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The Effects of Transmission Delay in 
Four-Wire Teleconferencing* — 


By A. G. VARTABEDIAN 
(Manuscript received May 4, 1966) 


The effects of transmission delay upon the performance of a three-party 
teleconference were investigated using a problem-oriented task. The tele- 
conference was simulated in the laboratory using 4-wire telephone sets 
interconnected to form a three-party conference network without echo sources 
or echo control devices. The two experimental conditions were characterized 
by (a) a network whose three legs provided transmission delays of 600- 
mulliseconds, 300-milliseconds and no delay, and (b) a network each of 
whose three legs provided no transmission delay. It was found that: (t) 
time to complete the experimental task was 28 percent greater in the delay 
condition than in the no-delay condition, (11) the error rate was less in the 
delay condition than in the no-delay condition, (i11) time-per-trial decreased 
with successive trials in both delay conditions; time-per-trial was less in 
the no-delay condition than in the delay condition, and (wv) no chairman- 
ship pattern developed as a result of time delay in the network: Moreover, 
not one of the subjects reported having observed the existence of delay in 
the voice path. There were, however, more complaints of ‘talking together” 
an the delay condition. 


I. INTRODUCTION 


Two technological developments have recently entered the field of 
international communication. The first is the advent of the communi- 
* This paper is drawn from the author’s thesis which has been accepted by the 


University of Pennsylvania in partial fulfillment of the requirements for the de- 
gree of Master of Science. 
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cation satellite. The second is the now widespread ability to establish 
teleconferences (telephone conference calls).* Relatively little is di- 
rectly known about either of these by itself, let alone how they may 
interact. This paper represents an attempt to systematically experi- 
ment with time-delayed communication in the context of conference 
telephony. The research examines the influence of the delay factor 
upon human teleconferencing performance on a given task and also 
explores the development of group structure in a network having several 
time delays. 

In the present connection, the significant factor resulting from 
the use of a satellite is the relatively large amount of time delay intro- 
duced into the signal transmission. Riesz and Klemmer?, in a study 
of delayed conversation between two people, found that round-trip 
delays less than 600-milliseconds do not degrade the acceptability of 
the circuit. Mitchell? and Emling and Mitchell® have given the signifi- 
cant parameters of time delay for various types of satellites. Low 
orbit satellites (with a typical round-trip time delay of 100 milliseconds) 
and medium orbit satellites (with 190 milliseconds) should cause little 
trouble because of time delay. Hence, this paper investigates the effects 
on teleconferences resulting from signal delays introduced by a syn- 
chronous satellite (with a total round-trip time delay of 540 milli- 
seconds). 

We consider first the effects of time delay on the communication 
process. In a study of the length of reference phrases used in a conver- 
sation between two people to describe ambiguous figures over tele- 
phone circuits, Krauss and Weinheimer* ® found that the length of 
reference phrases decreased on each successive occasion the figure was 
mentioned, eventually reaching a lower limit of one word. In a related 
but yet unpublished study, they found that when the circuit was 
degraded by the introduction of voice-operated devices, the mean 
number of words to describe the ambiguous figure was higher on the 
first occasion and decreased at a slower rate than when the telephone 
circuit was of standard quality. 

Viewed collectively, these findings tend to indicate that a degraded 
circuit disrupts the communication process and that learning (evi- 
denced by the decreasing length of reference phrases on successive 
references) is greatly slowed down. Hence, one might expect that time 
delay in a teleconference network could also degrade the communica- 
tion and slow down the learning process. 


*In the context of this paper, a teleconference is a conference among at least 
three people using standard telephone sets. 
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In their study of naturally occurring conversations between two 
people, Riesz and Klemmer? found that subjects did not find round- 
trip delays of 600 and 1200 milliseconds objectionable. The criterion 
used to measure “objectionality” in that circuit was the number of 
times each of the subjects rejected the delayed circuit for a normal 
one. Riesz and Klemmer chose to deal with naturally occurring speech 
since other studies have shown that the subtle conversational diffi- 
culties produced by delay do not often occur in structured conversa- 
tions. 

As long as one person does all the talking, it is impossible to detect 
the presence of even very large delays. The natural speech simply 
arrives a few seconds late. Likewise, the presence of delay will be 
unnoticed in highly structured conversation, where it is agreed before- 
hand that one person will not begin talking until the other has stopped. 
There are simply longer pauses between talk spurts. 

Delay plays an increasingly significant role in a nonstructured 
conversation. When the communication involves elements of informa- 
tion exchange, persuasion, or negotiation, the quality of the communi- 
cation channel becomes more critical. For instance, in such teleconfer- 
enced conversations it is sometimes necessary to cut a person off for 
questioning or voicing objection. Consider a time-delayed conversation 
between two people, A and B where ¢ is the one-way delay of the 
circuit. When A cuts into B’s speech, he is heard by B ¢ seconds later 
at which time B stops talking (assuming B has zero reaction time). A, 
however, continues to hear B for another ¢ seconds. Hence, for period 
2t (the round-trip time delay of the circuit) both people are talking 
simultaneously. If there are more than two people in the conference 
and some (or all) of them are talking simultaneously, disorder may 
result. 

An alternative to the naturally occurring speech technique in evalu- 
ating a communication system is the use of a problem solving task. 
The advantage of using a specified task over the naturally occurring 
speech technique is that the nature of the communication can be more 
closely controlled. Conceivably, naturally occurring speech could range 
from idle chit-chat to high-level negotiations with the characteristics 
of the conversation varying greatly even within a given class of tele- 
conference. Consequently, it is desirable to test the teleconference 
facility using a task that simulates the pertinent characteristics of 
anticipated conferences. 

We next turn to the aspects of group behavior which apply to tele- 
conferences. Here a number of parameters are of importance such as 
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nature of conference, size of group, network configuration, and whether 
or not a chairman is designated. The nature of the communication will 
be the most important factor governing the proceedings of the confer- 
ence. A large conference in which one member gives instructions or 
information to all others will certainly be quite different from a con- 
ference in which three of four heads of state attempt to negotiate a 
settlement to an impending crisis. Consequently, it is difficult to inves- 
tigate teleconferences without first defining the nature (information 
exchange, lecture, problem solving, persuasion, negotiation, etc.) of 
the conference. 

The size of the group is a particularly significant parameter in 
teleconferences. It is of even greater importance than in face-to-face 
conferences because of the greater difficulty in identifying participants 
due to the lack of nonverbal cues (Sinaiko®). A group size of three 
is the simplest form of teleconference possible. Here, the possibility of 
a deadlock is minimized since the minority is an isolated single person. 

The third important parameter is the network configuration. Bave- 
las et al’ have found that two basic configurations are of importance: 
one having a common circuit and one having a central control. Com- 
mon circuit networks are those in which all participants can hear when 
any one of them talks. Central control networks are those over which 
all parties transmit to a single station which in turn can relay the 
message to all others. Note that the central control network lends 
itself naturally to strong chairmanship; while the chairman in a com- 
mon circuit network must depend on rules and protocol to maintain 
order. 

In a study on teleconferencing, Heise and Miller® found that an 
information collection task (completing a list of words; each subject 
having part of the list) was performed most rapidly using a common 
circuit network while a task requiring assembly plus coordination was 
performed most rapidly using a central control network with a chair- 
man in charge. They also found that the differences between networks 
become more pronounced as “noise” is introduced into the communica- 
tion channel. As “noise” they used white noise. 

Sinaiko® found that in one teleconference, the chairman did not 
add to the effectiveness of the meeting because he had no means of 
enforcing his decisions. He found also that when using four conferees 
and acceptable circuits it was not difficult to consistently identify each 
voice. In a large teleconference of 12 participants in which the issue of 
chairmanship was deliberately left vague, a chairman did seem to 
emerge. When the conferees were asked whether a chairman emerged 
they generally agreed that the man who volunteered to call the roll 
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during the first few minutes of the conference was regarded as the 
chairman. 

In a time-delayed multiparty teleconference in which a chairman 
has not been designated, it is of interest to determine whether or not 
any one position in the network would be favored for the emergence of 
a chairman. Consider the network depicted in Fig. 1 to be used in the 
present study. The three positions A, B, C are separated from a central 
point (telephone company conference operator) by one-way delays of 
600, 0, and 300 milliseconds, respectively. The psychologically relevant 
factor here is the round-trip delay between any two given persons in 
the network. 

One might anticipate (Bavelas® and Guetzkow and Simon!) that the 
person occupying the position of “relative centrality” would most 
likely emerge as chairman. The central position is that position which 
is closer than any other position to all other positions. Hence, using 
time delay as a measure of distance, this theory predicts that B will 
emerge as chairman. It is felt, however, that the delays depicted in the 
figure are insufficient to overshadow those personality factors which 
are thought to determine the normal development of group structure. 


II. PURPOSE 


An experiment was run to investigate the subjective reaction to large 
time delays (experienced in synchronous satellite communication) in a 
three-party teleconference. The network simulated a teleconference 
whose three legs consisted of one satellite link, two satellite links, and 
no satellite link, respectively. Each satellite link had a one-way delay 
of 300-milliseconds. The two experimental conditions were specified 
by: 

(a) Delay condition — a network, Fig. 1, in which one leg is delayed 






CONFERENCE 
OPERATOR 


Fig. 1— Network configuration. 
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600 milliseconds, another is delayed 300 milliseconds, and the third is 
undelayed. 

(6) No-delay condition —a network in which each of the three 

legs is undelayed. 
All other parameters remained the same between experimental con- 
ditions. No attempt was made to simulate echo or the effect of echo 
suppressors even though they would be expected to play an important 
role in actual satellite communication. 

The two experimental conditions were tested using a task which 
required the subjects to communicate over the teleconference network 
about ambiguous figures (in the sense that they do not evoke a com- 
mon reference phrase). Such ambiguous figures have been used pre- 
viously by Krauss and Weinheimer to test circuit quality in circuits 
containing voice switching devices and in circuits containing delay. Sub- 
jects were faced with the task of describing the figures appearing on each 
of their stimulus cards and identifying the figures which were common 
to all three of them. There were 22 such trials. 

With respect to the above defined task and the two experimental 
conditions it was hypothesized that: 

(t) Time-on-task will be greater in the delay condition than in the 
no-delay condition. 

(72) There will be no difference in accuracy on task between the 
delay and the no-delay condition. 

(vw) Time-per-trial will decrease with successive trials in both 
delay conditions. For each trial, time-per-trial will be greater in the 
delay condition than in the no-delay condition. 

(tv) No one position in the network is likely to emerge as a seat for 
a chairman in the delay condition. 


III. METHOD 


3.1 Procedure 


Upon arrival at the laboratory, subjects were introduced to each 
other. They were then given verbal instructions by the experimenter. 
Essentially they were told that they were to participate in a conference 
call as though each one of them were in a different country and that 
their voices would be transmitted to one another over simulated 
satellite circuits. They were told that the purpose of the study was to 
determine the effects of satellites on telephone communication. Sub- 
jects were not told whether or not their network contained delay. 
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After receiving instructions, subjects entered individual rooms and 
picked up telephones they found there. They were asked by the experi- 
menter over the telephone to give their name, room number, and a 
listing of letters that were associated with the task. All subjects could 
hear the questions and answers. This was to identify the names with 
the voices and to give the subjects a “feel” for the circuit. 

Subjects then began their task. If there were any misunderstandings, 
they were assisted by the experimenter. This rarely happened beyond 
the first minute of the task. After completing the task, subjects were 
individually interviewed by the experimenter and were then told not 
to discuss the details of the task with their friends since they might 
also be used as subjects later. 

Experimental sessions lasted about 30 minutes and were spaced 45 
minutes apart. Each group participated in only one delay condition. 


3.2 Hxperimental Task 


The three subjects, each in an acoustically isolated room, talked 
together over seemingly normal telephone sets. Before them on a 
table was a set of 22 cards each numbered, and mounted in a desk 
calendar holder. The holder kept the cards in order and presented 
only one card at a time. 

Before entering their rooms, the subjects were told that on each 
card of their set were stamped five nondescript figures. They were 
told that two of those figures appeared on all of the cards while the 
other three figures appeared only on their own cards. Fig. 2 shows 
a sample stimulus card. By describing the figures to each other, they 
were to find out which were the two figures held in common. After 
agreeing on the first figure on card 1, each subject read the letter under 
it. They next located the second figure on that card, and each read the 
letter under it. They were then told to turn to card 2 and continue. 

The two common figures were selected at random (using a table 
of random numbers) from a set of 11 figures. Any given figure ap- 
peared as a common figure twice in cards 1 through 11 and twice in 
cards 12 through 22. No figure was a common figure on both of two 
adjacent cards. The common figures were placed at random positions 
on the cards. Common figures appeared an equal number of times in 
each position on the cards of each set. The noncommon figures were 
selected from the remaining nine figures and placed randomly on the 
cards. The total number of appearances of all figures were equal for 
each set of cards. 
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Fig. 2— Sample stimulus card. 


3.3 Apparatus 


The experiment was run in the Customer Services Appraisal Labo- 
ratory of Bell Telephone Laboratories at Holmdel, New Jersey. The 
laboratory consists of five acoustically isolated rooms designed for 
psychological testing. 

The network configuration for the experiment is given in Fig. 3. 
When the experiment was run in the no-delay condition, the delay units 
were physically removed from the circuit. The network employed 4-wire 
circuits providing isolation between receive and transmit paths. This 
was necessary to prevent echo and to interface with the magnetic disc 
delay units (Echo Vox Sr.) which are one-way devices. The delay units 
were calibrated for 600-milliseconds delay and 300-milliseconds delay, 
respectively. 

Because of the need for 4-wire circuitry, the standard 500-tele- 
phone set was modified as in Fig. 4. Artificial sidetone was provided 
and loss and circuit noise were adjusted to values representative of 
standard telephone circuits. 

The conference bridge was a standard 4-wire 6-branch bridge pro- 
viding 19.5 dB net loss. Only four of the branches were used in the 
circuit — three for the participants and one for the monitor. The am- 
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Fig. 3 — Experimental configuration. 
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Fig. 4— Modification of 500 type telephone set. 
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plifiers at the bridge were adjusted to provide unity gain through the 
bridge in all directions. 


3.4 Participants 


The 36 subjects were employees of Bell Laboratories and were fe- 
male clerks with roughly the same job classification. Their ages 
ranged from 18 through 25. All had at least a high schoo] education. 
The subjects were unacquainted with the experimenter at the outset 
of the study and were volunteers recruited by members of the Ap- 
praisal Laboratory. Most of the subjects were at least vaguely ac- 
quainted with the other members of their experimental group. 


3.5 Scoring 


All measurements were made from the recorded tapes of the ses- 
sions. In all but one or two instances, the experimenter was able to 
identify the person talking from the tape. The tapes were first played 
to make an overall time measurement, to determine which participant 
spoke first, to determine which participant offered her letter first, to 
determine which of the two common figures were found first, to detect 
any errors, and to become familiar with the progress of the session. 
The tape was then replayed to obtain time-per-figure measurements. 
In addition, a questionnaire provided subjective data. 


IV. RESULTS 


Table I presents the overall time each group spent on the experi- 
mental task. The average time to complete the task in the no-delay 
condition was 13.43 minutes while in the delay condition 17.23 min- 
utes. A t-test was applied to these data to determine whether or not 
transmission delay had an effect on time to complete the task. The ¢- 
test (tio = 3.81, p < 0.01) indicated that time-on-task was signifi- 
cantly greater in the delay condition. 

Hypothesis (11) postulated there would be no difference in accuracy 
on the experimental task between the delay and the no-delay condi- 
tion. Accuracy was measured by noting every occurrence of an incor- 
rect identification of a figure made by an individual. For any given 
figure, at most two errors could be made since the person describing the 
figure always correctly identified it by reciting the letter appearing un- 
der that figure. The data on accuracy are summarized in Table I. 

It was possible for the individuals in any one group to make a total 
of 88 errors on the task. Note that the greatest number of errors 
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TABLE I — TimE-on-TASK AND ToTaL Errors MADE By INDIVIDUALS 
FoR HKacu Group 
No-Delay 


Group 
Time-on-Task (min.) Errors 





15.37 3 
14.47 0 
13.74 6 
11.97 4 
14.01 0 
11.30 0 
Average: 13.48 Total: 18 


AOoorwne 


Delay 


17.48 0 
18.58 0 
16.72 1 
15.07 | 
20.02 0 
15.58 0 
Average: 17.23 Total: 2 


eee 
Nr owowtsn 





(made by Group 3) is 7 percent of this maximum. Of a possible total 
of 528 errors among all six groups in each experimental condition, 13 
errors (2.5 percent of total possible) were made in the no-delay con- 
dition while two errors (0.4 percent of total possible) were made in 
the delay condition. A Poisson distribution test on these data indi- 
cates that the no-delay error rate is significantly (0.01 level) greater 
than the delay error rate (see Ref. 11). 

Hypothesis (ut) stated that time-per-trial would decrease with suc- 
cessive trials in both delay conditions and that for each trial, time- 
per-trial would be greater in the delay condition than in the no-delay 
condition. The time measurements per trial are an average of the 
times for identfying the two figures on each of the 22 stimulus cards. 
Fig. 5 presents the time-per-trial averaged over the six groups in each 
circuit condition plotted on a log scale. 

A 2-factor (22 trials X 2 circuit conditions) analysis of variance 
under a logarithmic transformation of the data was carried out. The 
logarithmic transformation was employed to reduce heterogeneity of 
variance and because an exponential-shaped function was found. The 
logarithms of time-per-trial and trial number fit a least squares linear 
regression model with 0.9 correlation coefficient. 

As can be seen from Fig. 5, the time-per-trial decreases on successive 
trials for both circuit conditions. The analysis of variance showed 
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Fig. 5— Average time-per-trial (log scale) vs trial number (grouped 2 trials 
per data point). 


that the difference in trial times was significant (Fe1,0: = 30.8, p < 
0.001). Fig. 5 also shows that delay times are greater than no-delay 
times. The analysis of variance confirms this difference (F1,0. = 
27.5, p < 0.001). 

The sharp rise in Fig. 5 on the third trial is the result of the initial 
appearance of one ambiguous figure which all groups found exception- 
ally difficult to locate. Note that the curves begin to decrease much 
less after the eighth trial. By this time the subjects had converged on a 
single reference phrase for each figure. The twelfth trial marks the 
beginning of a random repetition of the first 11 trials. At this point the 
subjects have seen each of the 11 figures twice and now encounter 
them for the third time. 

Hypothesis (iv) stated that no one position in the network is favored 
to emerge as a seat for a chairman in the delay condition. The hy- 
pothesis was tested using measurements of who was the first individ- 
ual to begin describing a figure for each figure on the set of stimulus 
cards. This means of judging chairmanship was chosen because, to a 
rough approximation, chairmanship is determined by leadership which 
connotes the first to act. Also, during the interview the subjects gen- 
erally stated that their criterion for judging chairmanship was based 
on who was the first person to speak up. For each group the number 
of times the subject in each position was first to describe a figure was 
noted. There were a total of 44 (the number of common figures on 
each set of stimulus cards) first responses for each group. 
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An analysis was carried out to test, on the basis of frequency of first 
responses, whether a chairman actually emerged in each of the 12 
groups. Under the null hypothesis the probability of obtaining a 
chairman was assumed to be one-third. A chi-squared test was used to 
test the significance of the difference between the observed frequency 
and hypothesized frequency of one third. Table II presents the results 
of the chi-squared test (x? with 2 df, p < 0.05) indicating the position 
of the chairman in each of the 12 groups. From Table II the chi- 
squared test indicates that no chairmanship patterns are apparent in 
either circuit. condition. 

In addition, an analysis of variance of the data on number of first 
responses according to position in the circuit was made for the delay 
condition. The test resulted in failure to reject the null hypothesis at 
the 0.05 level that all positions in the network are equally likely for 
seating a chairman (F319 = 4.77 ns). 

A questionnaire consisting of five questions administered at the con- 
clusion of the experiment provided qualitative data. To the first ques- 
tion, “Did you have any difficulty in understanding the other parties 
in the conversation?”, all subjects in both delay conditions answered 
“NO”. But in response to this question, five subjects in the delay 
condition added that members of their group often were talking at the 
same time. Only one subject in the no-delay condition complained of 
talking together. 


TaBLE IJ — InpIcATED CHAIRMAN FOR EACH EXPERIMENTAL GROUP 





No-Delay 
Group 
Chi-Squared* Questionnaire? 

1 Cc Cc 

2 A None 

3 B B 

4 B B 

5 None C 

6 A 

Delay 

7 C None 

8 Cc None 

9 None A 
10 B C 
11 B A 
12 C C 





* Chairman indicated by x?-test on first person to respond measurements. 
b Chairman indicated by majority decision rule on questionnaire responses. 
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The responses to question 2, “Did you find that a conference chair- 
man seemed to emerge during the conversation?”, are summarized in 
Table II. If at least two subjects of each group said that the same 
person was chairman, then that person is designated chairman of the 
group; otherwise no chairman is indicated. 

To question 3, “Did you notice any difference between this circuit 
and the one you normally use here at the Labs?”, 10 of the 18 people 
in the no-delay condition responded “NO”. The remaining eight sub- 
jects who answered “YES” responded that the circuit was either 
“fuzzier”, “clearer”, or “farther” than their normal circuit. In the de- 
lay condition all but four subjects responded “YES” to this question. 
They commented that the circuit was either “fuzzier”’, ‘clearer’, 
“nearer”, or “farther” than their normal circuit. Of the 18 subjects in 
the delay condition, three in Position A, four in Position B, and one in 
Position C reported having observed a faint echo. The inadvertent 
echo was due to insufficient attenuation at the conference bridge. 

To question 4, “Were you able to identify the other parties from 
their voices in the conversation?”, all subjects in both circuit condi- 
tions responded “YES”. 

To question 5, “Do you have any other comments?”, eight persons 
in each circuit condition responded “That was fun”, indicating per- 
haps that delay can be as much fun as no-delay. 


V. DISCUSSION 


As was hypothesized, the overall time to complete the experimental 
task was greater with time delay in the network. The average value 
of 17.23 minutes for the delayed circuit is 28 percent greater than the 
average value of 13.48 minutes for the nondelayed circuit. These over- 
all average time values are composed of the 22 individual trial time 
measurements. As was cited in the previous section, these trial times 
and the trial numbers, both under a logarithmic transformation, were 
found to fit a straight line with rather high correlation, thus substanti- 
ating the exponential shaped function. The existence of the exponen- 
tial function would tend to indicate an underlying model that is mul- 
tiplicative rather than additive. What is meant by this is that any 
difference in no-delay and delay times would be properly expressed as 
a ratio rather than an additive difference. During any given trial the 
control of the circuit (as evidenced by the person talking) transferred 
many times among the three subjects. It is this behavior which is 
thought to be the cause of the multiplicative relationship between de- 
lay and no-delay trial times. 


TRANSMISSION DELAY 1687 


Although no quantitative data were collected on the components of 
trial time measurements, the experimenter observed a greater redun- 
dancy in communication with the presence of delay. More time was 
spent in giving descriptions and feeding back confirming information 
than in the no-delay case. There were also more questions asked of the 
person describing the figure. Descriptions did not seem to be any longer 
— there were simply more of them. In addition to greater redundancy, 
more time was spent in pauses and attempts to speak in the delay case 
than in the no-delay case. In answer to the first question on the 
questionnaire, five times as many subjects found difficulty because of 
talking together in the delay condition than in the no-delay condition. 

The increased time-on-task in the delay condition may be thought 
of as a compensation for the greater difficulties encountered with the 
delayed network. The data on errors indicate that the increased 
amount of time-on-task experienced with the delayed circuit induced 
more reliable communication. The error rate of 2.5 percent for the no- 
delay network is significantly greater than 0.4 percent for the delayed 
network. 

As hypothesized, the time-per-trial decreased with successive trials 
for each circuit condition. These time measurements include total 
elapsed time to select a figure, describe it, entertain all questions and 
comments, agree that the common figure has been located, and finally 
read off the letters that appear under the figure on the respective stim- 
ulus cards. The decreasing time-per-trial was observed to be a result of 
decreasing length of reference phrases for the figures as well as agree- 
ment among all three participants on a single reference phrase for each 
figure. A typical sequence of phrases for the ambiguous figure appear- 
ing above the letter O on the sample stimulus card (Fig. 2) is 
“.. looks like three Vs connected in the middle with a triangle in the 
center,” “... three Vs with the triangle in the center,” “... three Vs.” 

It is clear from Fig. 5 and supported by the analysis that for each 
trial, trial times are less under the no-delay condition than the delay 
condition. Furthermore, Fig. 5 indicates that no-delay trial times 
drop sooner to a lower level than delay trial times. This graph would 
tend to indicate, as thought earlier, that learning is impaired by the 
introduction of time delay into the circuit. This, however, cannot be 
supported on the basis of data obtained in the present study. 

As hypothesized, the delayed circuit imposed no tendency for chair- 
manship patterns to develop. 

Finally, a noteworthy result of the experiment (from the question- 
naire responses) was the fact that not one of the participants reported 
having observed the presence of a time-delayed circuit. Some com- 
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plained of increased “talking together” but this led no one to guess the 
underlying reason. 

The results obtained here were based upon 4-wire circuits without 
echo sources or echo control devices. The possibility of echo and speech 
mutilation due to echo-suppressor action in commercial 2-wire circuits 
could add degradation beyond that encountered in this experiment. 
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Mutual Synchronization of Geograph- 
ically Separated Oscillators 


By A. GERSHO and B. J. KARAFIN 


(Manuscript received July 27, 1966) 


A control scheme for synchronizing the frequencies of geographically 
separated oscillators connected by communication links consists of averag- 
ing the phases received at each station from remote oscillators, comparing 
the result with the local phase, and applying the filtered error signal as a 
correction to the local oscillator frequency. The system was studied by V. E. 
Benes who found a sufficient condition for the stability of the system using 
advanced mathematical techniques. In this paper, the stability condition 
is derived (for a slightly more general control scheme) using only the transfer 
function concept of linear systems and some properties of determinants. 
A practical difficulty regarding the final frequency of the oscillators is 
discussed and a modification of the control scheme is shown to alleviate the 
difficulty. Also examined are the questions of sensitivity to parameter 
changes, the effect of jitter noise on the performance of the system, and the 
effect of failure of an oscillator or transmission link. 


I. INTRODUCTION 


Consider a network of N geographically separated stations that are 
connected by directed communication links. A local clock, or oscillator, 
is situated at each station. The problem of synchronizing the frequencies 
of the oscillators is of considerable practical interest for continental 
pulse code modulation (PCM) systems. 

The local oscillators have frequencies which may be altered in pro- 
portion to a control signal. In the absence of external control, each 
oscillator operates at a different frequency. The network is ‘“‘connected”’ 
in the sense that from any station to any other station there is either a 
direct transmission link or an indirect path via one or more intermedi- 
ate stations. A fixed time delay is associated with each transmission 
link. 

In an important but unpublished paper, V. E. Benes! has examined 
a linear control scheme in which each station receives the phases of 


1689 


1690 THE BELL SYSTEM TECHNICAL JOURNAL, DECEMBER 1966 


neighboring stations, i.e., those stations connected to it by direct trans- 
mission links. The phases are averaged and compared with the local 
phase; the error is filtered and applied as a correction to the frequency 
of the oscillator. Similar schemes were also proposed by Runyon.? 
BeneS has proved that under suitable conditions the system is stable, 
i.e., the oscillators asymptotically settle to a common frequency and the 
phase differences have finite asymptotic values. He also finds explicit 
formulae for the final frequency and asymptotic phase differences. 
To obtain these results, he resorted to the mathematical techniques of 
renewal theory and Tauberian theory. By assuming the stability of the 
system, as proved by Benes, A. J. Goldstein® has rederived the expres- 
sions for final frequency and phase differences in a more direct manner. 
Bonomi, La Marche, and Varaiyat improved the treatment of the 
stability problem and suggested some avenues of approach for the 
study of transient response. In each case, the authors relied on the 
mathematical theory of Markoff chains and stochastic matrices. 

M. Karnaugh® has formulated a more realistic and more sophisti- 
cated nonlinear control model. Broad stability conditions for this 
model are not yet known; however, certain special cases resemble the 
Benes model. 

In this paper, the stability conditions and the expression for final 
frequency for a slightly more general version of the Benes model are 
derived in a simple manner using only the transfer function concept of 
linear systems and elementary properties of determinants. This approach 
permits a clearer intuitive understanding and should be readily com- 
prehensible to the non-mathematician. The sensitivity of the system 
to parameter changes is also examined and certain questions regarding 
the final frequency of the oscillators are clarified. 

In Section II we give a formulation of the problem and obtain the 
basic equations describing the system. In Section III certain crucial 
properties of the matrix of averaging coefficients are derived which 
result from the topological constraint that the network is connected. 
Stability is proved in Section IV and an expression for the final fre- 
quency is obtained. Section V considers some practical questions with 
regard to how the final frequency is related to the free-running fre- 
quencies of the oscillators. Section VI examines the questions of sensi- 
tivity to parameter changes, the effect of failure of an oscillator or 
transmission link, and the effect of jitter noise. 


II, FORMULATION 


Let f; be the frequency of the 7th oscillator in the absence of external 
control, and r;(¢) the control signal applied to the zth oscillator at time ¢. 
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If p,(t) denotes the total cyclical phase of the 7th oscillator, then the 
actual frequency at time ¢ is given by 

Bilt) = fi + rid) (1) 
where the dot denotes the time derivative. 

The control scheme at the zth oscillator is shown in Fig. 1. The 
phases of all neighboring stations are transmitted to the 7th station. 
The transmission delay associated with the path from station j to 
station z is denoted as 7,;;. Each phase received at station 7 is compared 
with the local phase; the differences are weighted with the nonnegative 
averaging coefficients a;; and summed. The weighted sum of phase 


a; 
t-7 ut SUMMING 
Py (t-Tin) + AMPLIFIER 
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Fig. 1— Station 7 of the phase averaging control system. 


differences is applied to the filter with transfer function H,(s), and the 
filter output is the frequency correction term 7;(¢). Thus, we have 


ri(t) = hi(t) x Dy aulps(t — ti) — ps(t)], (2) 


where h,(¢) is the impulse response of the filter at the 7th station and the 
asterisk denotes convolution. 

We assume the filters have three simple properties: (¢) causality, 
i.e., the response at any instant does not depend on the future of the 
input, (22) stability, in the sense that a bounded input always produces 
a bounded output, and (72) positive de gain, 1.e., 

H,(0) =r; > 0. (3) 


Without loss in generality we may assume that the averaging coeffi- 
cients sum to unity, 1.e., 


> aij = i. (4) 
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Clearly a scaling of all the coefficients a;; for fixed 7 is equivalent to a 
change in gain factor of the 7th filter. 

If there is no direct transmission path from the jth to the 7th station, 
then the coefficient a,; is presumed to be zero. Thus, the N X N ma- 
trix, A, whose 7jth element is a;;, contains all the topological informa- 
tion about the network of communication links. In order that mutual 
synchronization be possible, it is certainly necessary that the network be 
connected so that from any station to any other station there is either 
a direct or indirect transmission path. The resulting properties of the 
averaging matrix A imposed by this topological constraint play a 
vital role in the proof of stability for the system. 

In agreement with the Benes model we consider the starting condi- 
tions where the oscillators are assumed to have been free-running 
for an indefinitely long time prior to = 0, and at ¢ = 0 the control 
signals r; are connected to the oscillators. Thus, we have 


pit) = ft +p(0), %t<0 (5) 


where p;(0) is the phase at ¢ = 0, and from (1), (2), and (4), the fre- 
quency of the zth oscillator when the controls are operating is 


Bilt) = hilt) « Di ailp(t — rj) —pi(Ql +f, t20 (6) 
7 
Equations (5) and (6) for 7 = 1, 2, ---, N completely characterize 


the behavior of the system under the particular starting conditions of 
interest. Taking the ordinary Laplace transform of (6), we obtain 


sP; = Hy 3 dijP) — HiPs + =f: + pi(0) aes (7) 


where 


Giz = Qi exp (—S8r4)), 
0 
Q:(s) = Hi(s) Do a; | p;(t) exp (—st) dt, 
7 Tis 


and P,(s) is the Laplace transform of p;(t). The term Q,(s) is the con- 
tribution to the zth oscillator frequency after t = 0 due to the contents 
of the transmission links at ¢ = 0. Using (5), Q:(s) can be evaluated 
explicitly, but for our purposes it is sufficient to note that 


sQ,(s) ~0 as s—>0. (8) 


The transformed equations (7) can, in principle, be solved for the 
phases p,(t) for ¢ 2 0. The desired stability information can be obtained 
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directly from these equations. We shall, however, obtain this informa- 
tion in a somewhat indirect but more profitable way by defining an 
associated linear time-invariant system with N inputs and N outputs. 

Consider the same control arrangement for the N interconnected 
oscillators. Instead of the former starting conditions, suppose the con- 
trol paths have always been connected and that each oscillator can be 
activated by an arbitrary frequency “input” as shown in Fig. 2. Then 
the actual frequency of the 7th oscillator at time ¢ is the sum of the 
basic frequency input v,(t) and the correction component 7,;(¢) leaving 
the filter. The phases p,(f) are considered the “outputs” of the linear 
system. When »,(¢) = 0 for each 7, the system is at rest and all outputs 
p(t) are identically zero. 

The importance of the associated linear system is that any desired 
starting conditions in the physical model can be treated by an equiva- 
lent set of inputs to the linear system. To clarify this, note that the 
system of Fig. 2 is characterized by the equations 


pi(t) = hj(t) « 2d ailp;(t — 733) — pi(t)] + v(t), -2 <t< ~, (9) 


Formally taking the exponential (two-sided Laplace) transform of (9) 
we obtain 


sP; = H; >) Gi; P} — HiP: + Vi, (10) 
7 


where P;(s) and V;(s) are, respectively, the exponential transforms of 
p;(t) and »v,(t). Equation (10) implicitly characterize the associated 
linear system whose inputs are v,(¢) and outputs p,() as long as v,(t) 
has an exponential transform. Comparing (7) and (10) we see that the 
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Fig. 2— Model of station 7 of the associated linear system. 
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phase responses of the physical model with the corresponding starting 
conditions will be the outputs of the associated linear system for ¢ 2 0 
if we select the inputs to be 


1 
Vils) = sii + pi(0) + Q;(s). (11) 
In the time domain, these inputs are 
vi(t) = f(t) + p:(0)6@) + g(t), (12) 
where q;(t) is the response of the filter H;(s) to a time-limited input 
which begins at time f = —max 7,; and ending at ¢ = 0, 6(¢) is the unit 


impulse function and u(t) is the unit step function. From (8) and the 
final value theorem it follows that q;(f) — 0 as ¢— o. It is important 
to note that the phase responses to the inputs (12) will be the same 
as the phase responses of the physical model only for ¢ = 0. Fort < 0 
the responses of the associated linear system do not correspond to the 
physical model. 

Equation (10) may be expressed in the form 


P; = Gils) 3 dP, +(, a) (13) 
where 
= H;(s) 
Bi(s) = s + Hy(s) : (14) 


The simplified model of the linear system, corresponding to (18), is 
shown in Fig. 3 where 6,(s) is the transfer function of the feedback 
configuration as shown. Thus, the operation of the 7th station is to 
average the incoming phases, apply the average to the filter £6,(s), 


ait ae yy ees 
P, (t-Ti) 1 | 


Po(t-Ti2) 








Pylt-Tn) Bills) = Hi/(stHi) 


{ 


Fig. 3.— Simplified model of station ¢ for the associated linear system. 
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and finally add a filtered input component to produce the phase response 
p.(t). We shall see in Section IV that the condition for stability of the 
system is simply that the filter 6;(s) have a gain less than unity for 
sinusoidal inputs. 

Equations (13) for 7 = 1, 2, --- , N can be formally solved for the 
phase responses with the help of matrix notation. Let B(s) be the 
N X N matrix whose 77th component is 


bij(s) = Bi(s) Gi; 
and let C(s) be the diagonal N X N matrix whose 77th element is 


= 
s+ Hi(s)° 


Note that for s = 0 we have ¢,,(0) = 1/\; and B(O) = A, where d; 
is the de gain of H,(s) and A is the averaging matrix, both defined 
earlier. Let P(s) and V(s) be the N component column matrices whose 
ath elements are, respectively, P;(s) and V,(s). Then (13) becomes 


ci (s) => 


[i —- B}) P = CV (15) 
Se 
P= KV, (16) 
where 
K(s) = WU — B(s)* C(s) (17) 


is the matrix transfer function of the linear system. Thus, each element 
K;;(s) of K(s) is the scalar transfer function relating the output p,(¢) 
to the input v;(¢) when all other inputs are zero. In Section IV we shall 
determine certain key properties of the singularities of K;;(s). In order 
to examine the behavior of J — B(s) in the neighborhood of s = 0, 
certain important properties of the averaging matrix A will be needed. 
In the next section these properties are derived. 


III. PROPERTIES OF THE AVERAGING MATRIX 


As a result of (4), the averaging matrix A has row sums equal to 
one. From the requirement that the network be connected, certain 
restrictions are placed on which combinations of elements of A may be 
zero. These two characteristics of A imply certain essential properties 
of the matrix J — A where J is the identity matrix. 
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Theorem 1: If A ts the averaging matrix of a connected network of N sta- 
tions then the matrix I — A has rank N — 1. 


Proof: Since the equation 
(I —A)x =0 (18) 


is satisfied by any vector x with all components equal, the matrix J — A 
is singular and must, therefore, have rank less than N. Suppose that its 
rank is less than NV — 1. Then (18) has at least two nontrivial solutions 
that are linearly independent. Therefore, there exists a nontrivial solu- 
tion, u, whose components are not all equal. Now let w be the solution 
vector with each component equal to the negative of the smallest 
component of u. Then y = u + w is a nontrivial solution of (18) with 
all components non-negative and at least one component equal to 
zero. Let @ = {t1, %2, °-*, te} be the set of indices for which y; = 0 
and ® = {tra, tep2, °°° , tv} the set of indices for which y; is positive. 
Since y satisfies (18) we have 


| 
co) 


N 
Yi — Dy iY; = t= 1,2,---,N 
j=1 


and so 
>> aisy; = 0, forzie@. 
728 
But this is only possible if 
ai; = 0, z7eQ@ and jeB, 


which implies that there is no transmission path from any station with 
identifying index in @ to any station with identifying index in @. Con- 
sequently, the network of N stations is not connected, which is a con- 
tradiction. Therefore, J — A must have rank N — 1. 


Theorem 2: If A is the averaging matrix of a connected network, the co- 
factors of all the elements in any given row of I — A are equal and posi- 
tive. Specifically, if M;; is the cofactor of the 1jth element of I — A, then 


M;= Mx>O0 
fori,j,k =1,2,--- ,N. 
Proof: Since J — A has rank N — 1, the solutions of 
(I — A)jy = 0 
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satisfy” 


yi Mi; . 
=— = —— = 1,2,---,N. 
Yk M3,’ t m) ? 


But the only solutions y are those with all components equal. There- 
fore, 


M;; = My all 4,7,k. (19) 


Let R(e) = I — &A and let A/;;(e) be the cofactor of the 7jth element 
of R(e). For 0 S e < 1, each principal minor of R(e) is the determinant 
of a diagonally dominated matrix (see Appendix), so that 


Mile) #0 OSe <1. 


Since M;,(0) = 1, it follows by continuity that J7,,(1) 2 0. Hence, 
from (19) 


M;; = M x =O all 4,45Ks (20) 


Now (I — A)’, where the prime denotes the transpose, must also 
have rank N — 1. Thus, solutions of 


(I — A)’z=0 (21) 
satisfy 
SPM. Fg Dice: 
oa. © 1,2,---,N. (22) 


Equations (20) and (22) imply that the nonzero components of z must 
have the same sign. Suppose a solution z of (21) has at least one com- 
ponent zero and nonzero components positive. Then the same argument 
used in Theorem 1 leads to the conclusion that the network is discon- 
nected, which is a contradiction. Therefore, there is a solution z with all 
components positive and consequently (22) implies that all cofactors 
M;; are positive, which completes the proof. 


IV. ANALYSIS 


With the help of the preceding results, we are now in a position to 
prove stability and determine the expression for final frequency. These 
results will be obtained under the assumption that 8;(s), for each 27, 
satisfies the condition 


| Bi(jw)| < 1, w ~ 0. (23) 


1698 THE BELL SYSTEM TECHNICAL JOURNAL, DECEMBER 1966 


In Appendix B we show, with the help of the Nyquist criterion, that 
condition (23) implies the stronger condition 


|8(s)| <1 for s in &@, (24) 


where @ is the right half and imaginary axis of the s plane excluding the 
point s = 0. 

In Section II we saw that the associated linear system is characterized 
by the matrix transfer function K(s) given by 


K(s) = [J — B(s)} C(s). (25) 
Now, since 6; = H,/(s + H,), it follows from (24) that 
c(s) = 1/(s + H,) 


has no singularities in ®. Furthermore, under condition (24) the matrix 
I — B(s) is diagonally dominated (see Appendix A) for all s in @. 
Thus, the determinant | 7 — B(s)| is nonzero for all s in ®, and so we 
conclude that each component transfer function K,;(s) is analytic in 
the region @. 

At s = 0, the matrix J — B(s) reduces to J — A which is singular 
according to Theorem 1. Thus, the determinant |J — B(s)| has a 
zero at s = 0. To show that it is only a simple zero we find an asymptotic 
expression* for the determinant in the neighborhood of s = 0. In the 
matrix J — B(s), we replace the elements b,;(s) by their asymptotic 


expressions 
bij(s) ~ ai; E — (+5 + :) s| , sO 


where we have used the relations exp (—sr) ~ 1 — sz and H,/(s + H,;) 
~ 1 — s/\;. Without changing the value of the determinant, we may 
replace the first column by the sum of all the columns. The first column 


then becomes 
I i} 1 
(nt ihe(ntz)oe(n tz) 


N 
Tj = DS QijT a3 (26) 
j=1 


where 


is an average of the transmission delays of links arriving at the zth 


* The technique for finding the asymptotic expression is due to A. J. Goldstein. 
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station. Now we expand the determinant about the first column and 
obtain 


[1 BO) |~ eo (tt) Ma, s— 0, (27) 


where M,; is the cofactor of the 7th element of J — B(Q), as defined 
in Section IIT. 

Since My, is positive (from Theorem 2), it follows from (27) that 
| 7 — B(s)| has only a simple zero at s = 0. But ¢;;(0) = 1/), is finite, 
so that from (25) we conclude that each K,;(s) has a simple pole at 
s = 0. Using (25) and the asymptotic expression (27) it follows that 


Ki;(s) ~ ¥;/8, s—0, (28) 
where 
M ji); 
p> (« at *) Mu 
i dt 


Note that y; is positive and independent of 7 because M;; = Mj > 0, 
according to Theorem 2. Thus, letting 


Y= 





id Ma/dj_ 
ae > Mu/d (29) 
l 
we have 
Pere p> rid (30) 


with 0 <d; < land >\d; = 1. 

We have, therefore, shown that each transfer function K;;(s) is 
analytic in the right half and on the imaginary axis of the s plane ex- 
cept at s = 0 where it has a simple pole with positive residue inde- 
pendent of 7. The impulse response k;,;(¢), associated with K,,(s), will, 
therefore, consist of exponentially decaying sinusoids and a step function 
of height y;. 

To determine the stability of the original model under the particular 
starting conditions, we examine the asymptotic behavior of the phase 
responses of the associated linear system when subjected to the inputs 
given by (11). From (16), we have | 


Pi(s) = 2 Ki;(s) E =e p;(0) r ay(s) | . (31) 
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From the known properties of K,;(s), it follows that the phase response 
p(t) for £ > 0 will be the sum of terms decaying exponentially to zero 
plus a term of the form ft + 7; where f and 7; are obtained by the residue 
theorem according to 


f = lim 8?P,(s) 
s>0 
and 
<4 OE e8 
“n= lim 7s [s°P;(s)]. 
The final frequency f of the 7th oscillator is, therefore, given by 
N 
f= Qa vfi (32) 
i= 


which is independent of 7. Thus, we have proved stability of the system 
since the frequency of each oscillator has been shown to asymptotically 
approach the common frequency f and the phase differences clearly 
approach finite values. From (80) the expression for the final frequency 
can be written as 


> df; 


cs I =F » Tididi 


(33) 


which, with the help of (29), shows the dependence of f on the delays 
T,; and the de gains ), . 


V. REMARKS ON THE FINAL FREQUENCY 


From the results of the preceding section it is clear that the final 
frequency can be below even the lowest oscillator free-running fre- 
quency. In fact, it is evident from (383) that the final frequency is a 
monotonically decreasing frunction of the system gain-delay products. 
Thus, the controls may bring the system to a frequency outside its 
practical operating range. 

The final frequency reduction is a consequence of the fact that the 
frequency control of each station varies directly with the differences of 
total phase. The interstation delays introduce phase lags which drive 
down the frequency of each station. This point is made somewhat 
clearer by considering a system in which all the oscillators have the 
same frequency f and the same initial phase. When the controls are 
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applied at t = 0 an average phase “error” (—fr,) is applied to the con- 
trol path of each oscillator 7. This “error”? causes a simultaneous re- 
duction in the oscillator frequencies from which the system never 
completely recovers. 

As a conceptual solution to this difficulty, suppose the system of Fig. 
1 is modified so that the local phase at the output of the integrator is 
passed through a delay line before being compared with the incoming 
phases. This local delay at station 7 is chosen equal to 7;, the average 
delay of links terminating at station 7, as defined in (26). 

In the previous model, the error signal was determined by a com- 
parison of the local phase at the present time with the remote phases 
of earlier times. In this modified system, however, the comparison is 
made between phases which on the average occur at the same time. 
Thus, the undesired component of the error signal due to interstation 
delays is eliminated. 

Using an argument which parallels the development of Sections II, 
III, and IV, it may readily be shown that the final frequency for the 
modified system is given by 


f= Lai. (34) 


In contrast with the original system, it is evident that the final fre- 
quency of the modified system is always an average of the free-running 
frequencies. 

The Bene’ formulation (Fig. 1) may be viewed as a simplified ab- 
straction of the more complex practical systems that have been pro- 
posed.?»> Both the Benes formulation and the modified system contain 
a total phase comparator which is an impractical element. Karnaugh® 
has shown that an important linear subclass of the more realistic class of 
systems he has proposed obey equations of the same form (6) as in the 
Bene’ model. This more realistic formulation also fits the linear system 
model with modified frequency “inputs” that depend in a different 
manner on the initial conditions. It is, therefore, subject to the stability 
condition (23). 

Moreover, it has a different final frequency which approaches an 
average of the free-running frequencies as the interstation delays become 
large. 

In short, the BeneS formulation was sufficient to provide the im- 
portant stability criterion, but neglected factors affecting the final 
frequency. The linear system model developed here is general enough 
to be applicable to both the Benes formulation and a linear subclass of 
the more realistic Karnaugh formulation. 
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VI. SENSITIVITY AND RELATED QUESTIONS 


Suppose that the system has been operating in a synchronized steady- 
state condition for a long time, and at some instant, say ¢ = t) , asudden 
change is made in one or more parameters of the system. The subse- 
quent phase responses are determined by considering the new associated 
linear system subjected to suitable inputs equivalent to the pertinent 
starting conditions. These inputs will have exactly the same form 
as (11) but now the term Q,(s) will be evaluated using the past history 
of the phases given by 


pi(t) = fet + 0: t< lo, 


where f, is the synchronous frequency prior to the parameter change. 
If the stability condition (23) is satisfied for the new system and if the 
parameter change does not reduce any X,; or a,; to zero, the new system 
will also be stable. Consequently, after t = t the frequencies of the 
oscillators will asymptotically resynchronize to the new final frequency 
determined by (33) or (34) using the changed parameter values. ‘rom 
these arguments we can also deduce that the effect of a slowly time- 
varying parameter on the system operation is to cause a corresponding 
slowly varying synchronous frequency. By “slow” time variations we 
mean that the time for a noticeable change in a parameter value to 
occur is much longer than the time constants associated with the tran- 
sient response of each K;;(s). 

By similar arguments, it is easily seen that failure of a transmission 
link will lead to resynchronization if the remaining network is still 
connected. Also, in the case of oscillator failure, the remaining N — 1 
oscillators will resynchronize to a new frequency if the resulting network 
of N — 1 stations is still connected after removal of all transmission 
links entering or leaving the inoperative station. In each case, the final 
frequency can be computed from (83) or (84) using the appropriate 
parameter values. To prove these results, the nonzero averaging weights 
zan be rescaled so that A has row sums unity; the filter gains A; are 
assumed to be correspondingly rescaled. The characterizing equations 
for the new system then has the required form and so resynchronization 
will occur. 

The effect of independent jitter noise on the frequency of each oscil- 
lator may be considered by including a noise term 7,(é) in each ‘‘input”’ 
v(t). By superposition, the effect of noise can be considered separately. 
Thus, each phase response p:(é) will consist of the response in the 
absence of noise plus a noise component whose power density spectrum 
is 
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N 


> | Kio) [? S,(w), 

where S,(w) is the power density spectrum of n,(é). Consequently, if 
the input noise jitter has zero mean and finite variance the output 
noise components will also have zero mean and finite variance. We 
conclude, therefore, that in the presence of noise jitter each oscillator 
will asymptotically have a common frequency with a random perturba- 
tion. The perturbations will be correlated but, in general, will not be 
identical. Furthermore, small jitter noise implies proportionately small 
perturbations. 


VII. CONCLUDING REMARKS 


We have seen that the transfer function approach has permitted a 
simple treatment of a rather complicated control system. Further 
studies regarding transient response or bounds on the size of perturba- 
tions due to jitter noise can be made for particular topological configura- 
tions by determining more information about the transfer functions 
K,;(s) with the help of (25). The linear system approach together with 
the added generality of having different filters at each station has 
made it possible to consider the effect of parameter changes or oscillator 
failure on the behavior of the system. 
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APPENDIX A 


A square matrix A is said to be diagonally dominated when for each 
row the sum of the magnitudes of the off-diagonal elements is less than 
the magnitude of the diagonal element, i.e., 


las | > do Jas], each 7. 
TAA 


Theorem: If A is diagonally dominated it is nonsingular. 


Proof: Suppose the contrary. Then there exists a nontrivial solution 
{2;} satisfying 


» a2; = 0 each 72, 
: 
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Let 7 be one of the indices for which | x; | is a maximum. Then 


Arrly = a ArkLE 
kr 


so that 


Jan {la { Sd fan} |e] S De laal [a | 
; k¥r kr 
which is a contradiction. Hence, the theorem is proved. 


APPENDIX B 
On the Boundedness of 8;(s) 


Theorem: If 8;(s) ts bounded by unity on the jw axis then it is bounded 
by unity in the entire right-half plane. 


Proof: Since 


H;(s)/s 
1+ H;(s)/s’ 


the condition | 8;(jw)| < 1 is equivalent to 
[A | < |1+ A exp (i)|, (35) 


Bils) = 


where 
A exp (ig) = Hi(Gw)/jo. 
But (35) is equivalent to 
A cosy > —3 


so that the locus of H(jw)/jw, as w increases from — © to ~, cannot 
encircle the point —1. Hence, by the Nyquist stability criterion, 6;(s) 
is analytic in the right-half plane. Furthermore, 8;(«©) = 0. Thus, 
it follows that | 6:(s)| < 1 in the right-half plane according to the 
maximum modulus theorem. 
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A Model for the Organic Synchronization 
of Communications Systems 


By M. KARNAUGH 


(Manuscript received August 16, 1966) 


Organic synchronization 1s a method for the mutual synchronization of a 
set of geographically separated clocks. It 1s applicable to pulse code modula- 
tion (PCM) communications networks and to other systems which have 
similar requirements for synchronism. 

After a brief review and history of the problem, a model for organic 
synchronization 1s developed. A control-independent study of possible 
equilibrium solutions is then carried out. A special class of controls is shown 
to provide asymptotic stability in the limiting case of zero delays. This 
result leads heuristically to the synthesis of a broad class of nonlinear con- 
trols. With these controls, the systems are represented by families of non- 
linear differential-functional equations. This model provides a basis for 
the simulation of organic synchronization. Broad conditions which are 
mathematically sufficient for the stability of the nonlinear systems are not 
yet known. The final frequencies of a linear subclass of organic systems, 
known to be stable, are examined. 


I. INTRODUCTION 


The timing of the switching actions at each switching center of a pulse 
code modulated (PCM) communications system is governed by a device 
called the ‘local clock.” It may consist of a cyclic counter driven by an 
oscillator. Each cycle of the counter is then one clock cycle. 

In a geographically widespread PCM system, the local clocks may be 
either autonomous or synchronized. This choice should be made with 
the best possible knowledge of the available technology, as well as con- 
sideration of its functional and economic consequences. The choice is 
clearly a rather basic one, and it may have long term effects upon the 
evolution of the system. 

The time-multiplexed PCM signals arriving at any locality may have 
arbitrary, and usually scattered, points of origin. Some of them require 
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decoding into a common analog form. In particular, they may be voice 
signals. A homogeneous, time-multiplexed set of such signals is easily 
decoded by a common digital-to-analog converter, provided that the 
transmitted samples have been generated synchronously. A nonsyn- 
chronous alternative is to insert extra digits into the signals in order to 
permit multiplex transmission. Additional equipment is needed to re- 
move these digits and smooth the timing of the demultiplexed samples 
before or after decoding them. 

This paper is only one of a number of studies of system synchroniza- 
tion, and it does not provide a complete solution to the problems touched 
upon. After a very brief review of some past work in this field, I shall go 
back to fundamentals to derive a model for organic synchronization. 
Following this, the sections entitled ‘Equilibrium Points’’, “Reduced 
System Equations’, ‘Controls: Qualitative Discussion’, and “System 
with Zero Delays” provide background for the synthesis of a family of 
controls which is introduced under the heading, “‘A Family of Realizable 
Organic Systems’’. 

The question of the final frequencies of certain linear organic systems 
is then taken up. Finally, some remarks are made to clarify the stability 
problem. 


II. HISTORY 


The synchronization of PCM networks has long been a subject of 
interest. The question of synchronizing switching centers, in addition to 
the transmission links, arose in 1956, when the PCM telephone switch- 
ing experiment, later named Essex,! was planned. 

The term “organic synchronization’, which seems to have been in- 
troduced by V. E. Bene§,? will be used herein for systems fitting the 
model to be derived in later sections. The systems treated by Benes, 
excepting a certain minor idealization, form a subclass of these systems. 
This same subclass of systems is discussed in a patent? by J. P. Runyon. 

Benes? has demonstrated asymptotic stability for his systems, which 
are linear, under quite interesting conditions. He has also given formulas 
for the asymptotic system frequency and for the asymptotic relative 
phases of the oscillators. A. J. Goldstein* has given simplified derivations 
of these formulas. 

An alternative mutual synchronization method, called ‘frequency 
averaging’, has been treated by BeneS and Goldstein.® Frequency 
averaging systems, while stable, lack a frequency determining element. 
Each oscillator puts out the average of the frequencies received from its 
neighbors, and the system frequency will wander in the presence of 
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noise. Because of this feature, it does not seem to be very practical, 
unless it is combined with other techniques. 

The transmission of a synchronizing signal from a master oscillator 
to all other oscillators, which are locked to this signal, is perhaps the 
simplest approach to synchronization. However, such a system is vul- 
nerable to failure of the master oscillator or failure of a transmission 
link. Means for mitigating this weakness have been proposed by G. P. 
Darwin and R. C. Prim.* They equip the system with automatic means 
to reorganize itself in the event of a failure. Unfortunately, this adds 
considerable complexity to the basically simple method. 

Further discussion will be limited to organic systems for synchroniza- 
tion. 

B. J. Karafin’ has carried out some digital computer simulations of 
organic synchronization of small systems. A. Gersho and B. J. Karafin’ 
have simplified the proof of asymptotic stability for Benes’ systems. 
C. J. Candy and M. Karnaugh? have studied organic systems of up to 
four switching centers by means of an analog simulator. M. B. Brilliant 
has also studied linear organic systems!® and has computed transient 
responses of certain large linear systems.!! 

Linear systems with zero delays have also been studied at the Univer- 
sity of Tokyo by T. Saito, H. Fujisaki and H. Inose.” 


III. THE SYNCHRONIZED NETWORK 


Consider a set of N = 2 geographically separated pulse code switch- 
ing centers, interconnected by directed pulse transmission links, as 
illustrated in Fig. 1 for the case N = 4. 

All possible links need not be physically provided. The cases of great- 
est interest are those in which there is a directed path from any center 


LINK (2,1) 





CENTER 


Fig. 1— A sample network. 
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to any other center, possibly by way of some intermediate centers. 
Systems with this property will be called ‘‘connected systems’’. 

A connected system of N centers must have at least N links, because 
at least one link must terminate at each center. When the centers are 
connected in a unidirectional loop, there are exactly N links. The maxi- 
mum possible number of links, assuming no duplications, is realized 
when every ordered pair (2,7) of distinct centers is connected by a link 
to 2 from 7. This number is N(N — 1). The correspondence between the 
ordered pair (7,7) and the direction to 7 from 7 is a convention which will 
be followed consistently. 

An important component at each center is the local clock which de- 
termines the timing of all switching actions at that center. The messages 
from all other centers arrive in the form of framed pulse codes. These are 
pulse codes divided into sequences containing equal numbers of digits 
by means of periodically introduced framing digits. In order for the pulse 
codes to be correctly processed, a correct phase relationship must exist 
between the arriving framed code and the local clock. 

The desired phase relations are realized by providing a certain amount 
of buffer storage for each incoming link. Such equipment is illustrated 
in Fig. 2. The arriving digits are stored in a cyclically addressed discrete 
memory. They are read out of the memory under control of the local 
clock and of a circuit which monitors the appearances of the framing 
digits, so as to be correctly phased. 


INPUT 
REGISTERS 


RECEIVED 
SIGNAL 











TIMING 
EXTRACTOR 

























WRITE FRAMING ee 
COUNTER CONTROL CAL 
SELECTOR SELECTOR] | SYSTEM | CLOCK 








OUTPUT 
EGISTERS 


Fig. 2— Buffer memory. 
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SIGNAL 


ORGANIC SYNCHRONIZATION 1709 


Other forms of buffer memory which incorporate variable delay lines 
have also been proposed. These may be acceptable and less costly in 
some cases. 

Under favorable conditions, the arriving signals can be correctly 
phased by means of the buffers. However, unless the oscillator frequencies 
are properly controlled, their phase differences will wander beyond any 
bound. Then, some of the buffer stores will fill up or be emptied, causing 
erroneous codes to appear at their outputs. It is the primary object of 
the controls to avoid such malfunctions. The system will be considered 
to be operating synchronously when no information is being lost in this 
fashion. 

I shall adopt the point of view that it is desired to keep the buffer 
memories just half full, in which case the system would not be unduly 
vulnerable to transient disturbances. Controls will be sought which tend 
to inhibit large deviations from the desired condition. We shall see that 
these deviations cannot, in general, be reduced to zero. The oscillator 
control signals will be derived from them. 

It should be noted that the transmission delays between centers are 
variable over some small fractions of their center values. These delays 
will depend upon the environmental conditions of the propagating media 
and on message-induced jitter at pulse repeaters.!4 The buffer memories 
must mop up the delay variations as well as the effects of phase wander 
of the oscillators. 


IV. NOTATION 


The single subscripts 2, 7, k, ---, refer to the various centers and to 
the oscillators located at these centers. Their range is the integers, 1, 2, 
--+, MN. When one of them appears in a statement or equation with no 
other qualification, the statement or equation holds over the whole 
range. 

It has already been pointed out that the ordered pair (7,2) designates 
the link to center j from center 7. When a statement or equation con- 
tains a pair of subscripts with no other qualification, it holds for all 
pairs (7,7) which designate existing links. 

The set of all existing links will be called R. Thus, (7,2) ¢ R means 
there is a link to 7 from 7 in the system. 

Similarly, R; is the set of lnks terminating (i.e., receiving) at center 
i, and S; is the set of links originating (i.e., sending) at center 7. Thus, 
N 


R, = US = U {(j,0)}. 


7=1 


R= 


ioe 


°. 
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Let M be the number of links in the system. We have seen that 
NSESMSENUW -1). 


System controls will be supposed in effect for ¢ 2 0. Prior history of 
the system provides the initial condition. Statements about functions of 
t with no other qualification will hold for t = 0. 

Occasionally, vectors will be used having N singly subscripted com- 
ponents or else 1 doubly subscripted components. For example, the 
delay vector + has the M components 7;;, (j,i) ¢ R. It will be clear 
which vector space is meant in each case. 


V. PHASE, FREQUENCY, AND DELAY 


The local clocks will emit coherent signals. That is, for time intervals 
which are very long compared to one period of the clock, the output will 
be approximately periodic. Under these conditions, many formally 
different definitions of instantaneous frequency will be in good numerical 
agreement. I shall simply postulate the existence of such continuous 
functions, f;(t). 

Phases of the oscillators are defined to be 


pil) =a Ore i Ore (1) 


in cycles, and 
fi = pi. (2) 
The principal value of the phase is 
¢; = p; modulo 1 (3) 


and has the range 0 S g; <1. 
The initial condition for the phases will be 


pi(0) = (0). (4) 


The values, ¢;(0), are observables of the physical system. In fact, the 
switching actions at center 2 are timed according to ¢;(t). 

If there is a transmission link to center 7 from center 2, the signals 
transmitted therein will be subject to a time delay 7;;(¢). If a pulse is 
launched from center 7 at a time ¢, and received at center j at time fe , 
then the delay is defined to be 


7 ji (te) = te = ty . (5) 
Analogously, the phase of the signal received at time ¢ by center j 
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from center 7 is defined to be 


pii(t) = pit — 75: (t)]. (6) 


It should be recognized that the principal phase of a timing wave 
recovered from the received pulse code would, in practice, only approxi- 
mate ¢;;(t). Errors of a few percent of one pulse period are common in 
pulse repeaters.” This corresponds to a fraction of one percent of a typi- 
cal frame period. 

The frequency of the received signal is obtained by differentiation of 


(6). 
fat) = [1 — ry OI) -Sidt — 75:0). (7) 


This equation displays the Doppler shift of frequency caused by varia- 
tion of Tits 

The clock frequency at the ith center may be represented in the 
idealized form 


fi) = PF + 2; + git) + mt). (8) 


F is the mean center frequency of all clocks in the system, averaged 
over the time during which the system is observed. H; is the incremental 
center frequency of the 7th clock, also time averaged. By definition, 


N 
> LE; = 0. (9) 
i=1 


The time function g;(¢) is the contribution of the system controls, 
while 7; (¢) is a random noise with zero mean. There will be a symmetrical 
bound on g;(¢), 


loi) | S G, (10) 
which is supposed to be larger than the other frequency deviations. This 


is necessary if the controls are to bring all oscillators to the same average 
frequency. Under realistic conditions of operation, 


t+1/F 
G > max | #;| + max o (r/ n(s)ds) (11) 
i. a t 
t>0 


where o(-) is the standard deviation. 


VI. FUNCTION OF THE BUFFER MEMORY 


The principal phase of the signal arriving at center 7 from center 7 
will usually not agree with that of the oscillator at 7. The purpose of 
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the buffer memory in this link is to delay the signal by an additional 
time, d;;(t), so that 


ej(t) = gst — dj: ()). (12) 
In view of (6), this can be written 
eit) = gilts), (13) 
where 
ty = tb — dy lt) — ryt — dj). (14) 


Taking the right-hand derivative of (13), and because this derivative 
of the principal phase is always equal to the derivative of the phase, we 
get 


fit) = (1 — diy OV — rile — dik O} Filta), (15) 
where physical considerations make it clear that 
[aj],  |re| «1. 
Equation (15) implies the dependence of d;; on fi, f;, and 7j;. 
Matters can be simplified by shifting attention from the delays in the 


buffer memories to the numbers of frames, i.e., clock cycles, they con- 
tain. The number of cycles in the (7,7) buffer at time ¢ is: 


yt) = put) — palt — dj (t)) (16) 
and 
yi (t) = f(t) — TL — di’ O)-falt — du). 
Using (7) and (15), this can be put in the form 
yi (t) = fet) — F5), (17) 


which equates the rate of accumulation of cycles to the rate of arrival 
minus the rate of removal. In terms of the oscillator frequencies, 


y(t) = 1 — ry OI) -Sdt — 74:0] — 50). (18) 


Suppose the (7,7) buffer has a capacity of 2D; cycles, The normalized 
state of this buffer is defined to be 


ay(t) = Du? -[yu(t) — Dial, (19) 


which is the fractional deviation of its contents from half its capacity. 
In terms of this variable, (18) becomes 


vj (t) = Dy? [L — tH Ol Flt — tH) — Div -F;Q. (20) 
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This equation explicitly relates the derivative of the buffer memory 
state variable, x(¢), to the delays and clock frequencies in the system. 

I have remarked that the frequencies should be controlled so as to 
prevent any buffer memory from emptying or filling. For example, if 
buffer (j,z) is nearly empty, then we desire f; > f; until the situation is 
sufficiently corrected. On the other hand, if the (j,7) buffer is nearly 
full, then the inequality is reversed. However, things are complicated by 
the fact that all buffers associated with the zth center, that is, those in 
links of the set R; U S;, are affected by a change in f;. 

The system is said to malfunction whenever 
max | 2;,(¢) | > 1, 

yeR 


(7,7) € 
that is, whenever the buffer memory state vector leaves the ‘‘unit cube’’. 
Defining the nonnegative scalar, 


Tel x) = max: (aye; (21) 
(7,t)eR 


we see that the system is in a permitted buffer state when 


P(x) =. 1. (22) 


VII. EQUILIBRIUM POINTS 


Suppose the system is so controlled that an equilibrium solution to 
the system equations is possible. That is, in the absence of disturbances, 


x(t) =0 for £20 
and there exists a constant f such that 


fi@) =f for 20, ¢=1,2,---,N. 


If the system, in or near this state, is disturbed by a change in the net- 
work configuration, noise in the oscillators, or changes in some of the 
delays, then variations in the state of the buffer memories will result. 
To minimize the chance of malfunction under such disturbances, it seems 
reasonable to seek an equilibrium in which the buffer state is, in some 
sense, near x = Q. That is, the buffers are nearly half full. 

I shall begin along these lines by seeking the set of equilibrium points 
which can be reached from arbitrary initial conditions and under any 
controls whatever. The situation of asymptotic equilibrium to be con- 
sidered is as follows. 


(t) »@) = 0 for 


t 20; 
(it) ry(t) = r3:(0) > 


> 
0 for t= 0, (j,7) €R. 
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Ga linehe) Sh fee FSO ee 
t>0 

(iv) lim [p:(@¢) — p@)) = 4% for += 1,2,---,N, 
too 


where 


p(t) = wy oy Balt), ee 


Let x = lim x(t). The locus of attainable points x will be examined 


t>0 
under the above conditions. 
With the delay vector + assumed to be constant, (20) has the sim- 
ple form, 


ay = Dy -filt — 74) — Dis f(t). 


Therefore, 


= (0) + “aes iat 


8 
| 


ll 


x0) + Dyi* [ ae) =a ies 


The integral may be evaluated, using (1) and the conditions (¢77) and 
(iv) of asymptotic stability. The result is, 


DiGi — 94 — Tif) + Bi, (23) 


Uji 
where 
By = 20) + Dj (p30) — pi(—ri)] (24) 


is a constant which depends upon the initial condition. Equations (23) 
express the set of buffer memory equilibrium states attainable from a 
given initial condition in terms of the asymptotic phase differences and 
the asymptotic system frequency. This set depends upon the initial 
condition through the parameters, B;; 

It is shown in Appendix A that the set of phase differences 


{(G — Gs) | Gt) € R} 


contains exactly (NV — 1) linearly independent elements. There are M 
components of the asymptotic buffer state vector x, with M = N. 
Therefore, if f were an unconstrained real variable, we see that, as a 
function of the phase differences and f, x would range over an N-di- 
mensional linear manifold of its 1/-dimensional space. 
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However, the system frequency f must be near /’, the average center 
frequency of the clocks. More specifically, 


F+maxh;-G<f<F+mn#;+G. 


This inequality requires that the bound on the magnitude of the fre- 
quency control be large enough to reduce the highest frequency below 
the system frequency and to raise the lowest frequency above the system 
frequency. Thus, all clocks can be brought to a common frequency, even 
in the presence of noise. Nevertheless, G < F in cases of practical in- 
terest. 

In a typical application, G = 10°° F, so the domain of allowed values 
of f is a very narrow interval. It may be said that the range of x is a 
neighborhood of an (NV — 1)-dimensional linear manifold. The distance 
of this manifold from the origin is determined by the initial condition. 

In cases of practical interest 7;,G' < 1, so that 


pi(O) — pi(—tit) S tif + p30) — pi). 
In such cases, (23) has the approximation 
ti Sey (0) + Di la — 4; + ps5) — pi O)]. (25) 


From this it is clear that 7;; = x;:(0) if @; — 4; equals p;(0) — p;(0). 
Therefore, an asymptotic state vector x which is, in some sense, small is 
attainable when the initial state vector x(0) is small in the same sense. 


VIII. REDUCED SYSTEM EQUATIONS 


The trajectory of the buffer memory state vector is of central im- 
portance to this work. However, the system controls operate directly 
upon the clock frequencies. For this reason, it will be convenient to shift 
attention from the M equations (20) to an N-dimensional vector equa- 
tion for the frequencies. This equation, and its component equations, 
will be called the “reduced” system equations because N < M. 

I shall proceed under the assumptions of no frequency noise and con- 
stant delays; 


n(t) = 0 (26) 
a(t) = (0). (27) 

Then, (20) can be integrated to the form 
ty(t) = Dj [pit — tH) — pi] + Bi, (28) 
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where, as before, B;; depends upon the initial condition and is defined 
by (24). 
Now a simple change of variables is made. 


qi(t) = pit) — ft. (29) 
In vector notation, this is, 
a(t) = p(t) — fly, (30) 


where ly is an N-dimensional column vector with unit coordinates. 

If the system is asymptotically stable at the frequency f, then q’ (¢) 
will go asymptotically to zero. Therefore, given any controls, it will be of 
interest to see whether this condition is realized for any value of f in 
the allowed domain. 

Substitution of (29) in (28) gives 


tyi(t) = Dy lat =n) = q; (t) =— af Be (31) 
In view of (2), (8), (26), and (30), 
q’(t) = (FF —f)ly + E+ ¢(), (32) 


where g(t) is the increment to the clock frequencies under system con- 
trol. In particular, let 


g(t) = V{tx(-)}. (33) 


Each component of © must have a realizable dependence upon the buffer 
memory state vector trajectory. The problem of control synthesis is 
precisely the problem of finding a suitable form for Pr. 

The reduced system equations are the differential-functional equa- 
tions, 


q(t) = (F —f)ly +E +rf{exlq(-), f, Bl, (34) 


with x defined by (81). The parameter, B, which depends upon the 
initialization of the system, enters (84) as a parameter in the controls. 
It is, therefore, not surprising that the system’s trajectory and its final 
state, if any, depend upon its initialization. B is not entirely arbitrary. 
This can be seen by applying the condition for phase agreement, (12) 
at t = 0. Some manipulation of (16), (6), (12), (19), and (24) shows that 


By = Dy Kx — 1, (35) 
where K ;; is an integer, and 


K3 = 940) + pO) = ps7): 
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Given the initial phases, the initial condition can be changed only by 
integral changes in the numbers of cycles y(0) stored in the buffer mem- 
ories. 


IX. CONTROLS: QUALITATIVE DISCUSSION 


Loosely speaking, it is desired to control the system so as to keep the 
buffer memory state vector small, in some appropriate sense. More 
specifically, the vector should be kept away from the faces of the unit 
cube. 

These qualitative considerations will be made more concrete by de- 
fining a class of real valued functions r(x) of the buffer memory state 
vector, which will be called ‘penalty functions”. Each such function 
will have the following properties: 


(4) r(O) = 0. 

(ii) r(—x) = r(x). 

(zit) r(x) has a continuous gradient Vr (x). 

(wv) r(x) is strictly convex; that is, for any two distinct vectors 
X1, X, and any real number A in the open interval (0, 1), 


r[\X, + (1 — A)xe] < Ar (x1) + (1 — A)r (ee). 
(v) For any x such that 


= ] lim ——~ = 
i : ee r (sx) 

These functions will have a unique minimum at the origin and will go 
to infinity uniformly on all rays from the origin. When properly chosen, 
their convex surfaces of constant value may closely approximate the 
cube surfaces having equal values of max | 2;;|. This latter quantity, 

R 


however, does not have a continuous gradient. 

The attainable equilibrium points have been shown to lie in a neigh- 
borhood of an (N — 1)-dimensional linear manifold. The infimum of the 
values of r(x) over this set is realized at a unique point in its closure. 
This point is either the origin or else the point of tangency with a surface 
of constant r. After selecting a suitable penalty function, controls will 
be sought which bring the buffer state vector near this point. 

In attempting to reach this objective, a subclass of penalty functions 
having the simple form 


r(x) = do u(a;) (36) 


R 
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will be employed. The function u(-) must have the properties, (7) 
through (v), of a penalty function on a one-dimensional real space. A 
simple example of such a function is 


n 


Un(z) = 2°", n=1,2,--- 


Fig. 3 illustrates the surfaces, 


To = max (tw~,%1) = 1 
1 = 19. a to = 1 
n= 19" a ox" = 

n= ie” os tor =]. 


The last three are penalty functions of the type defined in (36), for a 
system having just two links. 


xX. SYSTEM WITH ZERO DELAYS 


The family of systems under consideration will have widely varying 
nonnegative delays for the transmission links. In many cases of interest, 
the product of maximum loop delay and control bandwidth may be very 
small compared to unity. In such cases, the extrapolation to zero delays 





Fig. 3— Curves of unit value. 
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may be a useful and illuminating exercise. One such system will be 
treated here as a step toward the synthesis of a family of controls. 

In the zero delay case, let the controls F be simply a function of x. 
This results in a system of ordinary, autonomous differential equations. 


q’(t) = (F —f)ly + E+T (x) (37) 
in which x is now defined by 
eu(t) = Djs 'lat) — a;(t)] + Bu. (38) 


The synthesis of (x) will be based upon a penalty function which 
takes the form given in (36). One of the desired properties of the system 
(37) is that it come to rest near the attainable minimum of r(x). It is 
therefore reasonable to try controls which have a component along the 
negative gradient of r(x). 

Let V,r (x) be the column vector whose 7th component is 


[var(x)]: = in r(x) (39) 


and let A be any N X N positive definite matrix. The controls to be 
considered here are of the form 


V(x) = —AV,7r(x). (40) 
Thus, we are assured that” 
[— Var (x)]*- r(x) = 0 (41) 


with equality if and only if V,r(x) = 0. 
Make the linear change of variables, 


w(t) = A q(t). (42) 
Then, the system equations are 


w(t) = A[(F — f)ly + E] — Vor(x). (43) 


The inverse of a positive definite matrix is also positive definite. 
Therefore, 


ly’A'ly > 0 
and it is possible to choose f such that 
ly’ ALF — f)ly + E] = 0. (44) 


* The superscript 7’ will indicate the transpose of a vector or matrix. 
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It is easy to verify that 
ly’ Var (x) = 0 (45) 


in the zero delay system. Having chosen f to satisfy (44), we find that 
the trajectory of w must lie in a linear manifold which is orthogonal to 
ly. It will now be shown that the system exhibits global asymptotic 
stability when w is restricted to this linear manifold. That is, from any 
starting point in the manifold, the system will ultimately come to rest 
at a unique point in the manifold. 

Consider the following function: 


L(w) = —w’A*A“[(F — f)ly + E] + r(x). (46) 


Because of some of the properties of r(x) and the connectedness of the 
systems under study, it can be shown that LD (w) has a unique minimum 
in every linear manifold orthogonal to ly. Also, V,.(w) is zero only at 
this minimum. The proofs are given in Appendix B. 

The time rate of change of L(w) is 


L(w) = [w (t)]’V.L(w). 


But 
V.L(w) = —A7A“[(F — f)ly + E] + A7Vzr (x) (47) 
= —A'w'(t). 
Therefore, 
L(w) = —[w'(t)]"A7w' (t) = —[w' (d)]"Aw' (t). (48 ) 
In view of the hypothesis that A is positive definite, 
L(w) $0, 


with equality if and only if w’(¢) = 0. When this occurs, 
—A'’w’' (t) = V,L(w) = 0 


and the system is at the minimum of L(w). Thus, L.(w) is a Liapunov 
function for the system”'* and the system is globally asymptotically 
stable in the restricted sense mentioned above. 

Inasmuch as 


q(t) = Aw’ (t), 


w (t) = 0 implies q(t) = 0 and the system of (37) and (40) is also 
globally asymptotically stable in the linear manifold of its motion. 
It is apparent from the definition (30) of q(¢) that the system has 
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the common frequency f when q’ (¢) = 0. The Liapunov function 
L(w) = L(@q) = —q’A“[(f — f)ly + E] + 7x @)] 
has its minimum at the equilibrium point, Qmin , where 


Vib (qnin) = —ATL(F — f+ E] + Vorlk Qmin)] = 0. 


The equilibrium point is seen to be offset from that point at which 
Var = 0 when the mistuning E of the clocks does not vanish. This should 
not be surprising, because the controls must compensate for the fre- 
quency differences. 

Suppose the (V — 1)-dimensional subspace orthogonal to ly is in- 
variant under A~’. Then the requirement (44) reduces to f = F, and 
the system frequency is the average of the clock center frequencies. It 
can be shown that when the matrix A has the above property, its column 
sums are all equal. In particular, if a diagonal matrix has this property, 
it is the identity matrix, multiplied by a positive scalar. 

In this section, I have considered global asymptotic stability, rather 
than trajectories within the unit cube. Attainment of a suitably bounded 
trajectory will depend upon the A matrix and the initialization of the 
system. 


XI. A FAMILY OF REALIZABLE ORGANIC SYSTEMS 
In the last section, the controls 


Ta) = — AVor(x) = — AV, Dulas) 
R 
were shown to stabilize the system of (37) with zero delays. A family of 
controls will now be synthesized so as to be realizable and practical for 
systems having positive delay. 
Equations (38) show that 


A w(ajs) = Dis "(Bin — On)u (crs) (49) 
Qk 

using the Kronecker 6 notation. Thus, when the matrix A is diagonal, 
the controls for the clock at center 7 depend only upon the buffer memory 
states in links terminating at center 7 or originating at center 7. This is a 
very desirable simplification, and there seems to be no merit in employ- 
ing more complicated forms. A more general type of control having this 
property is 


T(x) = Dd yj (Xn;) Ts d byw’ (xix), (50) 
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where the signs have been chosen to agree with the earlier model when 
xj, jx are positive. The development is quite heuristic at this point, 
because a proof of stability for the system having the controls (50) with 
arbitrary positive coefficients is lacking. 
The final model is based upon a modified form of the controls (50). 
(t) To achieve realizability, a delay A,; must be imposed on the 
argument of a control signal from the (j,4)th buffer memory to 
center k. 
(72) The controlled frequency deviations must be limited. For this 
purpose, I introduce the limiter function p(-) such that 


p(w) =a, |[x|[ Sl 
= 1, a ae | 
= —I, A al 


(zit) For the purpose of reducing system bandwidth, a filter with 
impulse h(¢#) and unit de response may be employed. Let + 
indicate convolution. 


T',{t,x( -)} a Gp (Mt) \, a,;u (x;;(t)] 


(51) 
aa pe bu [x(t — A:s)It). 
The complete system equations are 
q’() = (F — fly + E + ritx(-)} + a (52) 
tyt) = Dy {qdé — re@)] — 9) — ref} + Bu (63) 
By = x0) + Dj {p5) — pl—752(0)}. (54) 


Equations (53) and (54) have been obtained by integration of (20). 

The definition of the delay A;; in (51) will depend upon the manner 
of transmission of the control signal. When the state x,; is transmitted 
to center @ via link (7,7), 


Ais = Aw) = dist) + reilt — dij). (55) 


This form is particularly awkward because the buffer memory delay, 
d;;(t), is not one of the canonical variables. However, it can be very 
closely approximated as follows: 


dij (t) & Dislais) + LP. (56) 
A simpler but cruder approximation is 


dij(t) = DF. (57) 
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In the special case of (51), for which b;; = 0, this complication does 
not arise. 
XII. COMPARISON WITH THE SYSTEMS OF BENES 


V. ii. Benes? has analyzed a class of linear systems having delays and 
filters. In my notation, these systems obey the equations 


pi(t) =F +E;+Gh(t)* > Gijlpj(t — 73) 
me (58) 
— pi(t)] + a(t). 


He considers the delays to be fixed, the systems to be connected systems, 
and adds the constraints 


Gis > 0 for (2,7) € R; 
> Gi; = 1. 
Ry 
Then, assuming the noise 7 (t) to be bounded and to go asymptotically 


to zero, he finds a sufficient condition for global asymptotic stability. 
This asymptotic stability is defined by 


lim pi (t) “= Ts t= fi, Z; eae N 
to 


Bene§g sufficient condition is that 
G>0 


GH (iw) 


wel |<1 for allw ¥ 0. 





H(s) is the Laplace transform of the filters’ impulse response, h (Zt); 
w is real radian frequency, 7 is the imaginary unit, and it is assumed that 
H(0) = 1, as before. 

This condition is stricter than that needed to stabilize an ordinary 
phase controlled oscillator, but it is not too difficult to satisfy. It is also 
quite remarkable in its independence of the system graph and its delays. 

Bene also gives formulas for the final system frequency and phase 
differences. These have been rederived more simply by Goldstein,* using 
the final value theorem. 

A very direct approach to the final values is to insert them in (58), 
replacing the convolution with Gh (é) by multiplication with G. 


1724 THE BELL SYSTEM TECHNICAL JOURNAL, DECEMBER 1966 


Let 
lim p; (t) =f 
to 
lim [p.() — pw()] = Bi 
and note that 
lim pt 1g) pe) ao reg. 
Making the appropriate changes in (58), 
f=Ft+E;,+ GD, auslb — pi — Tif). 
Because jy = 0 by definition, we now have a set of N linear equations 
in the N variables f, #1, ---, fy_1. Using the notation, 
rs = Do disras, 
Rj 


for an average of the delays in the links to the 7th center, the equations 
assume the simpler form, 


(1 + Grif + Gp: — G0 Gixp; = F + E,. (59) 
Ry 
They have been shown by Goldstein to give the following solution for f 
F+ >> bE; 
= —_—___. 60 
J 1+G y bit: 00) 
Here b;, 7 = 1, 2, ---, N, depends only upon the averaging coefficients, 
Gi; , (2,7) € R, and 
b; 20 
Db; = I; 


A glance at (60) shows that the final system frequency is monotone 
decreasing with the product of the de gain, G, and an average of all 
delays in the system. This effect has caused some dismay, but it results 
from an unrealistic model. 

Let us go back to the family of organic systems defined by (51) 
through (54) and (29). These will be specialized in such a way as to 
obtain a class of linear systems analogous to that of Benes. The follow- 
ing steps must be taken. 

(¢) Eliminate the limiter, p(-), from the controls, (51). 
(ii) Let w(xij) = d0i;, so w (wij) = wij. 
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(12) Use the phase variables, 
pit) = q(t) + ft. 
(iv) Set b;; = 0, for all (j,z), in (51). 
(v) Make the identification 
Gj = ai;Dij* 
and impose the constraint, 


YS &; = 1 
Ri 


(vz) Assume the delays to be constant. 
With these changes, 


pi(t) = F + E; + Ghit)« a Gijlpj(t — ri3) — p:(t)] 


(61) 
+ G dX &:1Di:;(0) + p(0) — pi(— riz)] + a(t). 


This set of system equations differs from those of Benes, (58), only in 
the addition of a term which is constant in time, but which depends 
upon the initial condition. It may be considered to be a modification of 
the mistuning, /; , in the treatment of the stability problem. Therefore, 
the proof given by Benes of global asymptotic stability under his suffi- 
cient condition also applies to (61). 

Now let us derive the equations for final values. In doing this, note 
that the jth oscillator has the natural frequency, F + E;, fort < 0, 
while its frequency has the final value f. Therefore, 


pi(—tis) = ps0) — 7i5(F + Ej) 
lim Ips(t — ris) — pit)] = — tisf. 
Now, proceeding as before, (61) leads to 
fas Frist GQ dis (Bs; =p Fa) 
+ G2 GlDst:(0) + pi(0) — pO) + re5-(F + B,)I 


Putting this in a form analogous to (59), 
(1+ Gr)f + Gp; — GD) G6; = (1+ Gr)F + E; 
Ri 


(62) 
+G Dd, Gijlra;H; + Dijvis(0) + pi(O0) — p;(0)). 
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The solution for f, analogous to (60), is 


¢>> b,4:;D5;2:;(0) (63) 
= ff + ejb; + —~—_ ——_ 63 
x nae 1+ G45 bir: 
where the coefficients c are averaging coefficients defined by 
De (byije + Gbidkzsrsy) (64) 
R 
and the coefficients b are the same as before. 

The first two terms on the right of (63) give an average of the in- 
dividual clock frequencies. The last term depends on the initial condi- 
tion, but it goes to zero as the system delays become large. Thus, the 
behavior indicated by (60) does not really occur in our model for organic 
systems. 


XII. SOME REMARKS ON THE STABILITY PROBLEM 


The mathematical problem of stability is not yet satisfactorily solved 
for general organic systems. Two special families of organic systems are 
now known to be globally asymptotically stable. These are certain non- 
linear systems with zero delays and certain linear systems with delays 
and filters. These very special cases nourish the hope that broader 
sufficient conditions for stability can be found. 

The present section will be devoted to redefinition of the stability 
problem and a discussion of some necessary conditions for stability. 

We have seen that the system will malfunction whenever the M/-di- 
mensional buffer state vector x(t) leaves the unit cube. This leads to the 
following practical definition of stability. 


Definition: For any positive ¢, trajectory x(t) is e-stable if 


max | 2,(t)| S$ € for OSt<o. 


Definition: A trajectory is stable if there is an ¢ < 1 for which it is e- 
stable. 

The trajectory of an undisturbed organic system will depend upon the 
system parameters and the initial condition. Therefore, the domain of 
system stability must be defined in a space having the following coordi- 
nates, which appear in (51) through (54). G, h(-), a, b, u(-), A, D, 
x(0), E, « and p(¢) for ¢ S 0. The initialization of the filter states must 
also be defined. . 
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Several necessary conditions should be kept in mind when testing for 
stability. First, the asymptotic frequency of the system must lie within 
the controllable range, 


Second, the limit points of the buffer state trajectory must lie within 
the unit cube. Third, the system must be ‘‘connected,” in some sense. 
The connectedness of the systems deserves further discussion. I have 
required that the organic systems must be connected in the following 
sense: that there must exist a directed transmission path from each 
center to every other center. Benes has used the same condition. On the 
other hand, the stability of the special nonlinear systems which I have 
treated depends upon a weaker condition. Namely, that the nonoriented 
graph having a branch corresponding to each link must be connected. 
This condition is used in Appendix B, which is essential to the proof. 

An important difference between the systems treated here and those 
of Bene, is that the former have the additional control coefficients, 
b;; 2 O for (7.7) ¢ R. Thus, the state x,;(¢) of a particular buffer may 
exert a control over the frequency of the sending center, j, as well as 
over the receiving center, 7. 

Intuition suggests a necessary condition for stability based upon the 
control coefficients, a;;, b:;, which appear in the family of equations, 
(51). Inasmuch as negative coefficients tend to make the systems un- 
stable, these are assumed to be either positive or zero. 

Consider a ‘‘control graph” with nodes numbered 1, 2, ---, N. Let a 
directed branch exist from node 7 to node 7 if and only if a;; + b;; > 0. 
This condition permits the frequency at center 7 to be influenced by its 
phase relative to that at center 7. Then a necessary condition for system 
stability is that the control graph shall have a node from which directed 
paths exist to all other nodes. 

Under this “weak” condition, some parts of the system may simply be 
‘“‘slaves” of another part of the system. The “strong” condition that 
there exist a directed path from each node to each other node precludes 
this possibility. However, it should be understood that the condition 
satisfied by the control graph need not be satisfied by the graph whose 
branches correspond to transmission links. This is the case because each 
link may give rise to two oppositely directed branches of the control 
graph. On the other hand, a connected system may lack stability when 
too few of the control coefficients are positive. 

A simple example is provided by the system shown in Fig. 4. The 
digital transmission links appear in the “‘system graph” Fig. 4 (a). When 
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(a) (b) 
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Fig. 4— A system graph and three control graphs. (a) system graph; (b) in- 
adequately connected control graph; (c) weakly connected control graph; (d) 
strongly connected control graph. 


the only positive coefficients are da , d23, the control graph of Fig. 4(b) 
results. This fails to satisfy even the weak condition. It is clear that no 
means exists for bringing centers 1 and 3 to a common frequency. The 
control graph in Fig. 4(¢) results when ay, do, be3 are the positive 
coefficients. It must be assumed that a separate means for transmitting 
the narrow band signal, be; , to center 3 exists. This control graph satis- 
fies the weak condition. If the system is stable, center 3 will be a slave 
to the rest of the system. It will have no influence upon the frequency 
trajectories of centers 1 and 2. If aie, @e1, d23, bes are the positive co- 
efficients, then the control graph of Fig. 4(d) results. This one obeys the 
strong condition. When the control graph is the same as the system 
graph, Fig. 4(a), it is weakly connected. In this case, if the system is 
stable, center 3 determines the common frequency. 


XIV. SUMMARY 


A class of systems for the mutual synchronization of spatially sepa- 
rated oscillators has been synthesized and a mathematical model for 
these systems has been presented. The model may be said to be physi- 
cally realizable in that real systems can be built whose function will 
very closely approximate the behavior of the model. While no such sys- 
tem hardware has been presented here, a simple hardware analog has 
been. built.° 

These systems, called “organic synchronization systems,” have a 
possible application to continental or worldwide PCM communications. 

A re-examination of the systems treated by Benes in the light of the 
newly derived organic model indicates that 

(t) his stability proof does apply to a particular class of linearized 

organic systems, and 
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(ii) that his formula for the final system frequency must be modified; 
the suitably modified formula no longer displays a monotone de- 
creasing final frequency with increasing system delays. 


APPENDIX A 


Asymptotic Phase Differences 


The set of all differences of the form, (¢; — g;), can have at most 
(N — 1) linearly independent elements. To verify this, consider the 
(N — 1) elements 


(Qn — Gv-1), (Qw-a — Gv-2), °°) (he — hh). 
Suppose 7 > j. Then 
(C=O) SG = 6) > Ga qi-2) Ses ae (25 ae 
while 
(@; — &) = —@ — 4). 
Hence, any other element of the set of differences can be represented as 
a linear combination of the selected (NV — 1) elements. 


When the directed graph, which corresponds to the synchronizing net- 
work, is connected the set of differences 


{@ — 43) | Gt) « R} 


has at least (V — 1) linearly independent elements. Actually, only the 
weak, i.e., nonoriented, sense of connectedness of necessary for the proof. 


Theorem: Let G be a directed graph with N vertices such that the correspond- 
ing nonoriented graph is connected. Associate the N independent real 
variables, G1, Ge, °**, Gv, one to one with the correspondingly indexed 
vertices. Associate the difference (G; — G;) with the edge from vertex 7 to 
vertex j, for each edge in G. Let T be any complete tree of G. 

Then, there is a set of (N — 1) linearly independent differences asso- 
ciated with edges of T. 


Proof: We shall proceed by induction. The theorem clearly is true when 
N = 2. 

Suppose the theorem to be true for N = LZ — 1. Now consider G to 
have ZL vertices. Since it is connected, it contains a complete tree 7, 
which will have Z — 1 edges, but all Z vertices. It follows that not all 
vertices can have more than one edge of 7 incident on them. Let vertex 
j be an end vertex. Then, only one of the differences associated with the 
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edges of 7 contains g; . It follows that this difference, say +(@; — @;), 
is linearly independent of the remaining (1 — 2) differences. But if 
vertex 7 and the edge incident on it are removed from 7, a tree having 
(L — 1) vertices remains. By hypothesis, this contains (L — 2) edges 
whose associated differences are linearly independent. Hence, there are 
(L — 1) linearly independent differences associated with the edges of 
T, and the proof is complete. 


Corollary: The differences are unchanged when the average of the qs is 
subtracted from each of them. Therefore, the theorem applies even when 
the N real variables are constrained to have zero sum. 


APPENDIX B 


Properties of L(w) 


Certain preliminaries concerning convex functions will be necessary 
here. While they are familiar to mathematicians, other readers may find 
the following review helpful. 

All sets to be considered here are subsets of finite dimensional real 
linear spaces. All functions will be defined on such sets. 


Definition 1: A sct of points X is convex if every point of the set 
{AXe + ¢! = \)Xi2X1, X.eX and 0 Z\ASs 1} 
is also a point of X. 


Definition 2: A real valued function f(-) defined on a convex set X is a 
convex function on X if 


f[\X2 + (il —A)m] S Af (Xs) Gis \)f (&1) (65) 


whenever X1, X.¢« X and0 S$) <1. 
A convex function f(-) is strictly convex if the equality in (65) im- 
plies that’ = OorA = lorx = xX. 


Theorem 1: If two convex functions are defined on the same convex set, 
their sum ts a convex function on that set. If one of the functions ts strictly 
convex, then the sum is strictly convex. 


Theorem 2: If f(-) is a convex function on a convex set X, and if a = 0, 
then of (-) is a convex function on X. If f(-) ts strictly convex, and if 
a > 0, then of (-) ts strictly convex. 


Theorem 8: If ¢ is a fixed vector in an n-dimensional space, and x ts a 
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variable vector in the same space, and d is a real number, then (c”-x + d) 
ts a convex function on the entire space. 


Theorem 4: Let f(-) be a convex function on an n-dimensional space. 
Then the sets of points 


ixf(x) Sd, [xf(x) <d 


are convex subsets of the space for every real number, d. 
The above theorems are elementary consequences of the definitions, 
1 and 2. 


Theorem 5: Let f(-) be a differentiable convex function defined on a convex 
set X. Let X1, Xo be distinct points of X. Then the directional derivative, 
(d/dd) f[\x2 + (1 — X)xi], 2s an increasing function of d in (0,1). 


Proof: Select any \y > 0, 6 > O such that AX» + 6 < 1. Let 
Xo = AoXe + (1 — Ao)M, y=X%— XxX. 
Then 
(Xo + 6)X2 + (1 — Ao — 8)X1 = Xo + Sy. 
Now select an ¢ such that ¢ > 0 and A» + 6 + € < 1 and apply (65) 


twice as follows: 


6 
jax as ‘it Cone ee x] 


| 


=, —— f(xXo + dy + ey) + 5 S(m) 


f(x tey) Ss pelle + dy + ey) ae wert he 


Adding these inequalities and rearranging terms, 
f %o + dy + ey) — f(Ko + by) 2 f(Ko + ey) — fo). 


Dividing both members by ¢ and taking the limits as e — 0 yields the 
desired result, 


aries + (1 — d)xil 


d 
ay fax 7 ae d)xi] | = 2 ee 


Corollary: If f(-) ts strictly convex on the convex set X, and tf X1, Xe 
are distinct points of X, then (d/dd) f[Ax2 + Cl — A)mx] 2s a strictly 
increasing function of \ in (0,1). 
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Proof: Suppose that the equality holds in Theorem 5. If the directional 
derivative is nondecreasing in [\Ay, (Ao + 6)] and takes equal values at 
the end points, then it must be constant on this interval. It follows that 
f(-) varies linearly on the line segment from 


X = \oXe + (1 — )o)X1 to x= (Xo + 6)X2 + el — do — 6)x. 


This contradicts the hypothesis that f(-) is strictly convex on X, and 
the proof is complete. 


Theorem 6: Let f(-) be a strictly convex differentiable function defined on a 
convex set D. Let C be a closed and bounded convex set in the interior of D. 
Then, 
(t) f(-) assumes tts minimum value over C at a unique point of C, 
(i7) f(-) has a vanishing gradient at no more than one point of C, and 
(zit) f(-) assumes its minimum value over C at an interior point of C 
af and only if the gradient of f (-) vanishes at that point. 


Proof: 

(¢) The hypotheses imply that f(-) is a continuous function and 
that C is a compact set. It follows that f(-) assumes its minimum 
value over C, fmin, at some point of C. Now suppose that 
X,, Xp are distinct points of C' such that 


f (x) = f (x) = Sita . 
Then the strict convexity of f(-) implies that 


Xi Xe 
(2 si =) < Feito 


This is a contradiction of the hypothesis that fin is the least 
value of f(-) over C. 

(it) Suppose the gradient of f vanishes at two distinct points of 
C, x, and x,. Consider the directional derivative of f(-) along 
(x, — x1). By hypothesis, this derivative vanishes at x,. Be- 
cause f(-) is strictly convex, it is a strictly increasing function 
of position along (x, — x1). Therefore, it is greater than zero 
at X2, which contradicts the hypothesis that the gradient 
vanishes at X2. 

(it) Suppose f (Xo) = fmin and Xo is an interior point of C’. All points 
in a neighborhood of xX are in C. If the gradient of f(-) does not 
vanish at X), then there are points in this neighborhood, along 
the negative gradient from xo, at which f(-) assumes smaller 
values. This contradicts the hypothesis that fmin is the least 
value of f(-) in C. 
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Now suppose that the gradient of f(-) vanishes at x9, an in- 
terior point of C. Let x; be any other point of C and consider 
the directional derivative of f(-) along (x1 — x»). This vanishes 
at Xo by hypothesis. The strict convexity of f(-) implies that it 
is strictly increasing from xX» to x, . Therefore, f(xi) > f(xo) and 
f(-) has its minimum over C' at Xo. 

This completes the proof. 

In considering the properties of L(w), an additional result will be 
needed concerning the penalty function r(-). Its properties are num- 
bered (z) through (v). Note that the convergence in (v) was not as- 
sumed to be uniform over the unit sphere. This property will now be 
deduced. 

Because r(-) is a strictly convex function on the M-dimensional 
vector space, it is also strictly convex on any subspace. In particular, 
r(sX1) is a strictly convex function of s for any x: on the unit sphere. 

Then, for any real number P, however large, 


e(s,X1) = r(sxi1) — sP 


is a strictly convex function of s. Let #, be the set of vectors, x, on the 
unit sphere for which 


e(s,x) S 0. 


For any fixed s, £, is a closed subset of the unit sphere because e (s,x) 
is continuous. It follows that H, is a compact set. 

The corollary to Theorem 5 tells us that along any ray from the ori- 
gin, 1.e., for s going from zero to infinity, the derivative of e(s,x1) is 
strictly monotone increasing. From this it can be seen that x ¢ £;, 
implies that x, ¢ EH, for s > so. Therefore, the sets H, decrease as s 
increases from zero to infinity. 

The intersection of a class of compact, decreasing, nonempty sets is 
nonempty. Therefore, if 


it is clear that there exists an so such that H, = 0 for s = 8. In this 
case, 


r(sx) > sP forall s = so(P) 


independent of x on the unit sphere. 
On the other hand, if 


N E, ¥ 0, 
s=0 
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then there exists a unit vector x; , such that 
r(sxi) S sP forall sin (0,~). 


This contradicts hypothesis (v) concerning the penalty functions. 
Thus, we have seen that 7 (sx)/s goes to infinity with s uniformly for 
all x on the unit sphere. 
Now consider the function E(w), defined by (46), with w restricted 
to an (N — 1)-dimensional linear manifold orthogonal to ly . Let 


w=veo+aly 


where a is a real number and v is restricted to the (NV — 1)-dimensional 
linear subspace orthogonal to ly . 
In view of (38) and (42), L(w) can be put in the form, 


L(w) = Liv) =e -v+d+r(KAv +b). (66) 


Here, c is a fixed N-dimensional vector, d is a scalar constant, and b 
is a fixed M-dimensional vector, with M = N. The fixed N X N matrix 
A is positive definite. It can be seen from the discussion in Appendix A 
and (38) that the fixed M X N matrix K is of rank (VN — 1) for con- 
nected systems. Its null space is spanned by ly . 

Using the convexity of r(-), 


r(ZKAv) S ar(—b) + ar(KAv + b) 
r(KAv + b) = 2r(4KAv) — r(—D). 
Using this in (66), 
L(w) = L(v) = 2r (4K Av) + ¢’-v +d —r(—b). 


The right-hand member is dominated by its first term as | v | becomes 
large, uniformly over the subspace orthogonal to ly. Therefore, we 
can find a sphere of sufficiently large radius so that 


Liv) > L(O) 


for all v on its surface. Then the minimum value of L(v) over this 
sphere is not assumed on the boundary. 

Now it will be shown that L(v) is a strictly convex function of v 
on the subspace orthogonal to ly . Let 


xX, = KAv, + b 
X, = KAv>2 + b. 
The properties of K and A are such that v,; ¥ ve implies x; ¥ x.. This 
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permits us to apply the strict convexity condition, 

riK Adve + KA (1 — dX)vi + Db] 

r[\X2 + (1 — A)xi] < Ar(K.) + (1 — A)r (x) 
Ar (KAve + b) + (1 — \)r(KAvi + b). 


This is sufficient to establish the strict convexity of L(v). 

Inasmuch as the strictly convex function L(v) takes on its minimum 
value over every large sphere at an interior point, its gradient vanishes 
uniquely at that point. 

The above statement applies to the restriction of L(v) to an (N — 1)- 
dimensional space. However, we know that w (¢) vanishes along ly. 
Equation (47) then shows that 


V,l(w) = —A‘w' (t) 


I 


vanishes along a direction which is not orthogonal to ly . It follows that 
the unrestricted gradient of [(w) vanishes at a unique point of every 
linear manifold orthogonal to ly . 
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Systems of Mutually Synchronized 
Oscillators 
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The synchronization of large systems of geographically separated oscilla- 
tors ts of considerable practical interest for pulse code modulation (PCM) 
switching. This study examines the factors that determine the frequency at 
which such a system operates, considering both the procedure by which tt is 
set up and the topology of system interconnections. A necessary and suffi- 
cient connectivity condition is established. 


I. INTRODUCTION 


The synchronization of large systems of geographically separated 
oscillators is of considerable practical interest for pulse code modulation 
(PCM) switching. Synchronization could be achieved by establishing a 
single master oscillator, with every other cscillator slaved either di- 
rectly to the master or to another oscillator that is slaved directly or 
indirectly to the master. However, the system would then be vulnerable 
to failure of a single link or a single oscillator. An alternative called 
“mutual synchronization”? would permit the oscillators to determine the 
system frequency jointly and to exchange synchronization information 
over redundant paths. However, the complexity of the system raiscs 
questions concerning the factors that determine the system frequency 
as well as system stability and dynamic response. 

A broad sufficient condition for the stability of mutually synchronized 
systems was first established by Benes.! This condition has recently been 
rederived by a different method, for a slightly more general system, by 
Gersho and Karafin.? The model used in both these studies was over- 
simplified so that it gave a paradoxical result for the system frequency 
at equilibrium. A model that corrected this oversimplification, by 
considering the received signal phases observed at each oscillator station 
at the initial moment when all oscillator controls are put into operation, 
was first devised by Runyon.’ A corrected model based on the same 
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principle, but differing in detail, has recently been independently derived 
by Karnaugh.+ In all these studies, it was assumed that the system was so 
interconnected that every oscillator transmitted timing information 
either directly or indirectly to every other oscillator. This condition has 
been generally assumed to be necessary for mutual synchronization, and 
has been proved sufficient by Gersho and Karafin.? 

This paper will generalize the foregoing results in two ways that appear 
to be significant for practical applications. In the first place, a mathe- 
matical model will be described that allows the synchronized system to 
be set up by less drastic methods than the simultaneous closure of all 
control paths at t = 0. In the second place, a weaker connectivity condi- 
tion, which is satisfied by systems in which only some of the oscillators 
participate in frequency determination, will be proved necessary and 
sufficient for synchronization. 

The practical consequence of these generalizations is that a system 
with a single master oscillator can be regarded as a special case within 
the general class of mutually synchronized systems, and a locked oscilla- 
tor synchronized to a remote source can be regarded as a special case of 
an oscillator station in a mutual synchronization system. Between the 
extremes of a system with no slaves and a system in which all stations 
but one are slaves, a variety of hierarchical organizations may be en- 
visioned. However, the description of particular configurations is beyond 
the scope of this article. The model developed here also provides the 
flexibility by which new stations can be added to an existing system, and 
the system frequency can be adjusted after synchronism has been es- 
tablished. 


II. THE MATHEMATICAL MODEL 


The system is assumed to consist of N oscillators, or “‘clocks,”’? num- 
bered 7 = 1, --- , N. Each oscillator has its own free-running frequency 
f;, at which it would operate in the absence of a control input. Each 
oscillator accepts a control input that causes its frequency to deviate from 
the free-running frequency by an amount proportional to the control 
input. For concreteness, the control input will be referred to as a voltage, 
although it may, in practice, take other forms. Thus, the instantaneous 
frequency of the 7th oscillator, which will be expressed simply as the rate 
of change of phase p,’(¢), will, in general, be different from the free-run- 
ning frequency f; . 

In applications to switched PCM networks, each oscillator controls 
the timing of a digital signal which is assumed to be organized with a 
fixed number of pulses per frame. It will be convenient to measure phase 
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in frames of the digital signal, and frequency in frames per second. Each 
station sends a digital signal, controlled by its local oscillator, to a num- 
ber of other stations, and this signal conveys timing information. To 
simplify the description, it will be assumed that all these signals are sent 
in the same phase, and this will be taken to define the phase of the local 
clock. However, the model could easily be adapted to the case in which 
each signal is sent in some arbitrary but fixed phase with respect to the 
local clock. 

The transmission delay from the jth station to the 7th will be desig- 
nated as 7;;. Thus, the phase of the signal received at the 7th station 
from the jth is p;(¢ — 7;;). The phase is defined principally by a regular 
pattern of framing pulses. The pulses between the framing pulses carry 
information, and are therefore different in successive frames. Since 
successive frames are distinguishable, the cyclic ambiguity inherent in 
the measurement of the phase of sinusoidal signals is not inherent in 
the digital case. 

Thus it is possible to measure, at the 7th station, the phase difference 
pi(t — ti;) — pi(t) between the received signal from the jth station and 
the local clock. This phase difference will be called the ‘‘observed phase”’ 
of the jth signal at the 7th station. In the Benes! model, used also by 
Gersho and Karafin,? the control voltage at the oscillator consists only 
of components proportional to the observed phases. However, as Gersho 
and Karafin? pointed out, if all the clocks are in phase all the observed 
phases will be negative, and every clock will be made to run slower than 
its free-running frequency. In the present model, a fixed reference phase 
r;j will be subtracted from each observed phase, this reference phase 
preferably being equal to the phase difference one would expect to ob- 
serve. If the observed phase of each signal is equal to the reference 
phase, no control voltage is applied to the oscillator, which then runs at 
its free-running frequency. 

Historically,’ the concept of mutual synchronization evolved in terms 
of phase averaging. Thus, the observed phases of the received signals 
were respectively multiplied by nonnegative averaging coefficients a,; , 


N 
Dy aj = i (1) 
= 


to form an average phase difference between the local clock and the 
signals received from its neighbors. The average observed phase may 
then be multiplied by a nonnegative factor A; , having the dimensions of 
inverse time, to determine the frequency displacement of the local clock. 
This basic notation has been continued in subsequent studies and will 
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be used here. Thus, in the present model, the system equations must be 


i(t) = fit ry ulpj(t — ri) — pilt) — rig, 
D f 2, aulp p ss 


i=1-++,N, 


-The reference phases can be absorbed into the free-running frequency 
term by defining a reference frequency 


N 
vi =fi— MD ars, i=1,---,N. (3) 
j=1 
The system equations can now be written as 
- ‘ 
pi(t) = 1+ Dd, aslpj(t — riz) — pilt)], t= 1,+++,N. (4) 
= 


These equations have formally reverted to those of the Benes model,!” 
in which the reference phases do not appear. However, while the equa- 
tions are the same, their application is different, since v; is not the free- 
running frequency, but is normally greater than the free-running fre- 
quency, because the reference phases r;; are normally negative. The 
reference phases may, in fact, be identified with the initial-condition 
terms of Runyon? and Karnaugh,* so that (4) covers their models as 
well as the Benes model. 

The dynamic response of the system can be modified by using a filter 
in each control system. Multiplication by d; is then replaced by convolu- 
tion with the impulse response h,(é) of a filter whose zero-frequency gain 
is 


[ na =x, ee ees (5) 
0 


This has been done in all the referenced studies. Gersho and [Karafin? 
also added a variable term to v; , replacing it formally by v,(é), to repre- 
sent the effects of transient disturbances. The system equations there- 
fore become, in the most general form to be used here, 


pi(t) = v(t) + hy(t) * x aijlpj(t — 743) — pi(t)], (6) 


(eel wel, 
where the asterisk (*) denotes convolution. Neither of these changes 


affects the equilibrium frequency. 
It has been assumed that the filter gains and averaging coefficients are 
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all nonnegative, 


ri 2 0, (7) 


aij = 0. (8) 


The connectivity of the network depends on which of these coefficients 
are zero. If the 7th station does not receive from the jth, a;; is zero, 
except when the 7th station does not receive from any other station, in 
which case (1) forbids all a;; to be zero and ; must be zero, and the a,; 
are then arbitrary. It is understood that if the zth station in fact receives 
a digital signal from the jth, but uses it only as a medium of communica- 
tion and does not use its observed phase in controlling its clock, it will 
be said that the 7th station ‘‘does not receive from” the jth. 


III. THE INITIATION OF SYNCHRONOUS OPERATION 


Previous studies have assumed that the system is placed in synchro- 
nous operation at t = 0 by simultaneously closing all the switches at each 
station that connect the control voltages to the oscillators. It is assumed 
that before t = 0 all oscillators are operating at their free-running fre- 
quencies, and have been running for a sufficiently long time so that, in 
spite of transmission delays, all stations are receiving signals on all links 
by the time the switches are closed. Closure of each switch will, in general, 
cause an immediate change of frequency at every station, and prediction 
of the frequency at which the system finally will settle down would be a 
matter of practical importance. 

In practice it may be preferable to assemble the system in more lei- 
surely fashion — one station at a time — checking for proper operation 
after each station is connected before connecting the next one. One might, 
for example, realize the reference phases 7;; as manually controlled bias 
voltages. When a new station is to be connected into the system, the first 
connection will be made at the new station, from one of the phase detec- 
tors to the input of the clock control filter. This connection will syn- 
chronize the new station with the system as a slave station, and adjust- 
ment of the corresponding reference phase can be used to establish any 
desired phase relation between it and the rest of the system. When each 
subsequent connection is made from a phase detector to a clock control 
filter, the associated reference phase is adjusted so as to null the voltage 
across the switch at the moment when it is closed. There is then no dis- 
continuous change in frequency at any time during the connection 
process. 

If the system were built up in this way, starting from one station as the 
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initial system, and if there were no drifts in either free-running fre- 
quencies or transmission delays, the final equilibrium frequency of the 
system would be the free-running frequency of the first station. In this 
case the equilibrium frequency could be predicted without any calcula- 
tion. In any case, the system frequency can be deliberately changed after 
initiation of synchronous operation by adjusting the bias voltages. 

The equation for system frequency is still useful as a means of predict- 
ing the effects of drifts in the free-running frequencies and the transmis- 
sion delays. However, serious questions can in principle arise with regard 
to the applicability of the general system equations (6). These equations 
represent a system with invariant connectivity, represented by invariant 
averaging coefficients a,; and gains \; , while the actual system connectiv- 
ity has been a function of time. Karnaugh* and Gersho and Karafin? 
have answered these questions for their models under the particular 
initiation procedure they assumed. The answer will now be extended to 
cover the present model for arbitrary initiation procedure. 

I shall take the point of view that there is some specifiable moment fp 
at which the system has been completely assembled, so that the a;; and 
d; are invariant for ¢ = t), and that we need only to predict the future 
behavior of the system, for specified disturbances in v,(¢) and drifts in 
7:;, having full knowledge of the past behavior of the system. For the 
purpose of this discussion, if the transmission delays 7,; are to be allowed 
to change, they should be considered as having been written as 7;;(t). 

The system equations (6) are actually integrodifferential equations, 
since the convolution symbol (*) implies an integration. The initial 
conditions on which the solution of this equation depends are the entire 
history of the phase variables p,(¢), to the extent that this history deter- 
mines the state of the filters. The output of each filter for ¢ = t) can be 
considered as the sum of two components: a transient term determined 
by the state of the filter, which is in turn determined by the input for 
t < ¢, and a term representing the response to inputs for ¢ 2 t). The 
transient terms can be calculated from the known filter inputs for ¢ < to 
and included in the v;(t) terms. Equations (6), with these terms included 
in v,(t), with the filters considered quiescent at t = ft, and with the 
correct initial values of p:(to), will then give a correct description of the 
behavior of the system for t 2 to. 

This argument is included here only to establish the validity of (6) in 
principle. In practical calculations, estimates of the effects of transient 
disturbances would normally assume an equilibrium state as the initial 
condition. 
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IV. EQUILIBRIUM STATES 


Gersho and Karafin? determined the equilibrium frequency of the 
system as a limiting value derived by means of the final value theorem 
for Laplace transforms. Karnaugh‘ used a simpler method, claiming for 
it only heuristic value. The following approach claims rigorous validity 
for the simpler method. 

The first step in the analysis of the system will be the determination 
of its equilibrium states, without regard for whether they are stable or 
unstable equilibria. These can be determined by assuming that the sys- 
tem has been placed in some state, and that it will not change state 
spontaneously; any state that satisfies these conditions is an equilibrium 
state. We determine in this step whether the equilibrium state is unique. 
The second step is to determine whether the system can respond to any 
transient excitation with components that do not approach zero with 
increasing time; this step determines the stability of the system. The 
linearity of the system now implies that if the equilibrium state is unique, 
and the transient response approaches zero, the system will always 
approach the equilibrium state in the absence of a disturbance. 

It will, in fact, be found that the equilibrium state is not unique, 
because the system equations include the phases only in phase differ- 
ence terms, and an arbitrary common constant can be added to every 
phase variable without changing the phase differences. There is, there- 
fore, a continuum of equilibrium states, all of which are equivalent for 
practical purposes in that they have the same phase differences and the 
same system frequency. Because of this equivalence, the system will be 
considered stable if, after a transient disturbance, it approaches any 
equilibrium state, not necessarily the one it occupied before the dis- 
turbance. This requires only that the transient components of the phase 
differences approach zero, while the transient components of the phases 
may approach arbitrary limits. 

This section will deal only with the first step: the identification, in- 
cluding determination of conditions for existence, of equilibrium states. 
The stability of the equilibrium states can be assured by the sufficient 
condition studied by Gersho and Karafin;? their proof remains valid 
under the weaker connectivity condition shown here to be necessary 
and sufficient, the statement that at least one Mj is positive sufficing 
to replace their statement that all 17; are positive. 

The only equilibrium states to be considered here are those in which 
all clock frequencies are constant at a common value; if they are con- 
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stant, but at different values, synchronism has not been established. It 
will also be required that the existence of such a state should not de- 
pend on the values of the free-running frequencies; if it does, the system 
is not self-synchronizing, but is synchronous only if the clocks are ad- 
justed by means external to the system. 

The instantaneous frequencies p,’(é) are, therefore, set equal to an 
equilibrium frequency denoted simply by f. Then 


pit) = ft +P, @ = 1 Aes Ns (9) 


Equation (4), which suffices even in the most general linear case repre- 
sented by (6) for the description of the steady state, becomes 


N 
f= vet ds 2, ai(P; — Ps — frit), t=1,---,N, (10) 
jm 
or, in more symmetrical form, 
N 
DD A653 — ai;)P; = v: — f(1 + Avi), t= 1, ee dN (11) 
7=1 
where 
N 
i= > QijT ej (12) 
j=l 


and 6;; is the Kronecker delta, equal to unity for7 = 7 and zero otherwise. 
The set of equations (11) looks as though it could be solved for the P; in 
terms of arbitrary v; and f, but it cannot, because the matrix of coeffi- 
cients on the left is singular. This will be stated and proved asa theorem. 


Theorem I: Let L denote the diagonal matrix with diagonal elements 
Ai, let A denote the averaging matrix with elements ai; , and let I denote 
the identity matrix. Then the matrix LUI — A), with elements \.(6;; — ai), 
has rank less than its order N. 


Proof: By (1), the sum of the elements in any row is zero. Therefore, 
the sum of all columns is a column of zeros. Therefore the matrix is 
singular and its rank is less than its order, Q.E.D. 

It is advantageous at this point to choose one P; arbitrarily as a refer- 
ence for the others. With P, as reference, we change to the phase difference 
variables 


Qe fp Pig FE 2h Ne (13) 
The equations (11) then become 
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N 
Dili — @:;)Q; = v5 — fl + Aare), 1 Aes N. C14) 
rap 
If the term in f is transposed to the left side, we get the set of N equations 
N 
fCL + dari) Te Ais — a)Q3 = 4, t= 1,-+-,N, (15) 
= 


which we expect to be able to solve for the N unknowns f and Q;,7 = 
2,--+-,N, for arbitrary v;. 

If we formally solve for f by determinants, and expand each deter- 
minant in terms of the elements of the first column and their cofactors, 
the result is 


N 
yD bv: 


f= _—— ___, (16) 
2 bi(1 + ars) 


where b; is the cofactor (signed minor) of the element in the first column, 
ith row, of the matrix L(J — A). The following theorem shows that the 
arbitrary choice of P; as the reference for phase differences, and the 
definition of b; in terms of the first column, makes no difference in the 
result. 


Theorem II: Let M,,; be the cofactor of the (2, 7)th element of LI — A); 
then M;; = Mix for all 2,7, k, that 2s, all cofactors of elements in the same 
row are equal, and hence M;; = 6; for alli,7=1, ---,N. 


Proof: If the rank of L(J — A) is less than N — 1 then all 1/;; are 
zero and the theorem is satisfied. If the rank is N — 1, the matrix equa- 
tion LJ — A)x = 0, where x is an N element column matrix, has only 
one independent solution. It is known from (1) that a solution exists in 
which all components are equal, and this must now be true of any solu- 
tion. It can also be shown that the cofactors of any single row of the 
matrix L(J — A) must be a solution (see, for example, Guillemin‘), 
hence all cofactors of elements in a row must be equal, W7;,, = Mix, 
hence M;; = My = b;, Q.E.D. 

Since b; can now be defined without reference to any particular column, 
the single-index notation is justified. Since we expect that increasing the 
free-running frequency of any oscillator will never decrease the equilib- 
rium frequency f, we should expect all the b; to be nonnegative. The fol- 
lowing theorem verifies this expectation. 
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Theorem III: The cofactors of elements of the matrix L(I — A) are 
nonnegative, b; 2 0,4 = 1,--- ,N. 


Proof: If a matrix is diagonally dominated, i.e., if every diagonal 
element is greater in magnitude than the sum of the magnitudes of all 
other elements in the same row, it is easily shown (Appendix I, Gersho 
and Karafin?) that it must be nonsingular. Consider the matrix L(J — 
eA),0 S ¢ S$ 1. The cofactors of its diagonal elements are continuous 
functions of ¢. For e = 0 they are all unity, hence positive. For0 < «<1 
the cofactors are the determinants of diagonally dominated submatrices, 
hence nonzero, so that they cannot pass through zero, and must remain 
positive. Hence, as e — 1 they cannot approach negative limits. But as 
é — 1 they approach the values b;, hence b; = 0, Q.E.D. 

The formal solution (16) is valid if and only if the matrix of coefficients 
on the left side of (15) is nonsingular; that is, if and only if the denomina- 
tor of (16) is nonzero. But, since b; , \; , and 7; are all nonnegative, this is 
equivalent to the condition that at least one b; be positive. The following 
definitions and theorems relate this algebraic condition to the connectiv- 
ity properties of the network. 


Definition: The jth station is said to send to the 7th, or equivalently, 
the zth station is said to receive from the jth, if X,a;; is positive. 


Definition: The jth station is said to send directly or indirectly to the 
ath, or equivalently, the 7th station is said to receive directly or indirectly 
from the jth, if there exists a chain (ordered set) of stations such that the 
first is station 7, the second receives from j, each receives from the one 
before, and the last is station 7. 


Theorem IV: If the kth station does not transmit directly or indirectly to 
all other stations then b,; = 0. 


Proof: Let Ax, be the submatrix formed by deleting the kth row and 
column of L(J — A). Let S; be the set of indices of all stations that do 
not receive directly or indirectly from the kth. By hypothesis S, is 
nonempty ; choose 7 « S; . By the definition of S; , X,a:; is zero if 7 is not 
in S; , hence, from (1), 

ye NiQ; = ri ) Le Sz . (17) 
7S; 
Let B, be the square submatrix of Ax, consisting of all elements whose 
row and column indices are both in S; . Then (17) shows that B; can be 
written in the form L’(I — A’), where A’ is an averaging matrix satisfy- 
ing (1). Hence, by Theorem I, B, is singular, and the rows of B, are 
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linearly dependent. But since the 7th row of Az, , for all 7 e S; , is the 7th 
row of B, augmented with zeros, the same linear dependence holds among 
the rows of Ay, . Hence, Ax: is singular; hence its determinant, which is 
b; , is zero, Q.E.D. 


Theorem V: If by cs zero, then the kth station does not transmit directly 
or indirectly to every other station. 


Proof: Let Ax, be defined as in the proof of Theorem IV. By hypothe- 
sis, Are 1S singular; hence there exists a column matrix x, with elements 
x; not all zero, such that Ay. = O, or equivalently 


» NiGejX; = Ny ) t x k. (18) 
i#k 
Let MW be the magnitude of the x; having the largest magnitude. Let S, 
be defined now as the set of all indices 7 for which | x; | = M; obviously 
S, is nonempty. Now (18) implies 
D> Raises] => U.M, 1€ Si s (19) 
1k 





Now (1) and |2;| < M imply that this can be true only if \,a2, = 0 and 
|x;| = M whenever X;a:;; > 0. Hence, for all 7 e S,, A,a;; = O except 
when 7 « S;, and thus the 7th station cannot receive directly or indirectly 
from the kth. Since S;, is nonempty, the kth station does not transmit 
directly or indirectly to all stations, Q.E.D. 

It follows from these theorems that the formal solution (16) is valid 
for the set of equations (15) if and only if there is at least one station 
that transmits directly or indirectly to all other stations. 

If there is no such station, the matrix of coefficients on the left side of 
(15) is singular, and the set of equations has either no solution or an 
infinity of solutions, depending on the values of the v; . Since a solution 
defines an equilibrium state in which all oscillators run at the same fre- 
quency, this means that the oscillators will run at the same frequency 
only if their free-running frequencies are appropriately adjusted; that is, 
the system is not self-synchronizing. . 

If there is only one station that transmits directly or indirectly to all 
others, that station is the master, setting the frequency for the whole 
system. A single master receives from no other station, since any station 
that transmitted to it would thereby transmit indirectly to all other 
stations. Thus, a station can become a master simply by the loss of all 
inputs from other stations. However, if two stations lose all their inputs, 
the system fails to synchronize, since neither station sends directly or 
indirectly to every other. 
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If more than one station sends directly or indirectly to all others, these 
stations are mutually synchronized, and jointly establish the system 
frequency. Any station that does not send directly or indirectly to every 
other station is in effect a slave station. 


V. SUMMARY AND CONCLUSIONS 


The process for initiation of synchronous operation described in Sec- 
tion III is not necessarily recommended as the best possible. It is in- 
tended as a constructive existence proof, showing that there exists a 
method of setting up a synchronized system of geographically separated 
clocks that will lead to a final frequency that can be determined in 
advance. The second part of that section shows, in perhaps unnecessary 
detail, that the behavior of a system, once it has been set up, can be 
determined without considering how it was set up, so that it is not neces- 
sary to specify the set-up procedure before studying its steady-state or 
dynamic behavior. 

Under these circumstances the equation for equilibrium frequency 
developed in Section IV plays no part in the process of setting the system 
in synchronism and adjusting it to run at the desired frequency. It 
serves to identify the factors that affect the final frequency and indicate 
the quantitative effect of each factor, and as such would appear to find 
its greatest usefulness in the design and control of the configuration of 
system interconnections. 

The connectivity condition evolved in Section IV permits the inclusion 
of single-master systems in the same general class as completely mutually 
synchronized systems. It is suggested that these two types are in fact 
opposite extremes of a more general class in which the most useful con- 
figurations may have some intermediate hierarchical form. 
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Intermodulation Noise in FM Systems 
Due to Transmission Deviations and 


AM r PM Conversion* 


By T. G. CROSS 
(Manuscript received August 2, 1966) 


Two noise contributors in FM systems are: (t) intermodulation noise 
due to transmission deviations; and (11) intermodulation noise due to trans- 
mission deviations and AM/PM conversion, designated AM /PM inter- 
modulation noise. Expressions for the second- and third-order AM /PM 
intermodulation noise are derived in terms of transmission medium coeffi- 
cients and a continuous pre-emphasis characteristic, with the unpre-em- 
phasized baseband signal being simulated by white Gaussian noise. These 
expressions have been programmed on a digital computer and representative 
noise responses and properties of AM/PM intermodulation noise were 
obtained. General properties and characteristics for the two noise contribu- 
tors are documented in parallel for comparative purposes. It was found that 
AM/PM intermodulation noise can be a significant noise contributor in 
FM systems. 


I. INTRODUCTION 


Intermodulation noise is produced whenever a phase modulated signal 
is passed through a linear transmission medium whose amplitude and 
phase characteristics are nonlinear functions of frequency. The output 
signal from this medium is both envelope and phase modulated, with the 
phase modulation being a distorted replica of the input phase function. 
The envelope modulation and phase modulation functions are similar in 
that both consist of first (linear), second-, third-, and higher-order func- 
tions of the input phase function. They differ in that the coefficients of 
the terms making up the two modulating functions are related in dif- 
ferent ways to the transmission medium characteristic. 

The distortion terms higher than first order, in the output phase 


* Portions of this paper were presented at the 1966 IEEE International Com- 
munications Conference in Philadelphia, Pa., June 16, 1966. 
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modulating function, produce intermodulation noise. This source of 
noise has been the subject of much work over the past ten to twenty 
years. The envelope distortion terms directly produce no degrading 
effects in linear systems. However, when the linear transmission medium 
is followed by a device that converts envelope variations at its input to 
phase variations at its output then a different noise-generating mech- 
anism exists. This latter source of noise will be designated as ‘““AM/PM 
intermodulation noise” to distinguish it from the intermodulation noise 
produced directly by transmission deviations.* The two phenomena are 
illustrated in Fig. 1 which depicts the two-step process involved in the 


v(t) Ww) Vy (t) DEGREES Vo (t) 
Y (w@ —— 

INTERMODULATION dB INTERMODULATION NOISE 
NOISE DUE TO DUE TO TRANSMISSION 
TRANSMISSION DEVIATIONS AND AM/PM 
DEVIATIONS INTERMODULATION NOISE 
t|wet to (t) 

v(t)=ExP [wet +9 (t)] 


v, (t)= exp alt) exp "lect +o (+)] 
Vo (t) = expai (t) expe oct +¥o (t) +ka (t)] 


WHERE 
@(t)= PHASE MODULATING FUNCTION DUE TO MULTICHANNEL SIGNAL 
Polt) = P(t) + PHASE DISTORTION TERMS 


k =01516 K=PHASE MODULATION INDEX IN RADIANS DIVIDED BY 
THE AMPLITUDE MODULATION INDEX 
DEGREES 


K=AM/PM CONVERSION CONSTANT MEASURED IN 
ASSUMING a(t) <<1 


a,(t)# a(t) IN GENERAL 
Y (W)= TRANSMISSION MEDIUM WITH TRANSMISSION DEVIATIONS 


Fig. 1— Model illustrating sources of intermodulation noise due to transmission 
deviations and AM/PM conversion. 


AM/PM intermodulation noise generation. The AM/PM converter 
will be characterized by the constant K which has the dimension of 
degrees/dB and can be interpreted as the peak phase change at the out- 
put for a 1-dB change in envelope at the input. In reality, this K may be 
a function of a number of quantities, e.g., carrier drive power, frequency 
(carrier and/or baseband), bias levels, or may even be complex. However, 
many presently developed broadband radio systems use TWT amplifiers 
as power output tubes which are often the major source of AM/PM 
conversion within a radio repeater. These tubes, when driven at moder- 

* Transmission deviations are defined as any deviation in the gain and phase 


characteristics from the ideal characteristics of constant gain and linear phase for 
all frequency components of the FM wave. 


NOISE IN FM SYSTEMS 1751 


ate, essentially constant input power level and biased from well controlled 
sources, are adequately characterized for small envelope fluctuations by 
a constant K degrees/dB.! 

Both noise phenomena, are of prime interest in frequency modulated 
systems. Intermodulation noise due to transmission deviations is of 
interest because it is a recognized significant noise source. AM/PM 
intermodulation noise is of interest because of the basic lack of knowledge 
which has existed on this subject. Due to this deficiency, the AM/PM 
phenomenon has become the underlying scapegoat for many system 
problems that appear to be unexplainable using existing system knowl- 
edge. 

The purpose of this paper is two-fold: (z) to present the mathematical 
development and ensuing solution for the problem of AM/PM inter- 
modulation noise in FM systems; and (zz) to provide enough general 
information about the two noise contributors considered in this paper 
such that one can analyze a system’s performance and/or set system 
requirements with some degree of confidence without having to neces- 
sarily utilize the associated digital computer programs. 

The analysis to follow considers a linear transmission medium, with 
generalized transmission deviations, followed by an AM/PM converting 
device. The baseband signal is simulated by a Gaussian distributed band 
of noise with flat power density spectrum which is pre-emphasized by a 
continuous pre-emphasis function before the FM process. The end result 
of the treatment is the signal-to-noise ratio for second- and third-order 
AM/PM intermodulation noise. The mathematical framework for this 
paper is derived from a recent paper which treated the subject of inter- 
modulation noise due to an imperfect transmission medium.? Certain 
facets of that work will be included here for the sake of continuity. 


II. THEORY FOR AM/ PM INTERMODULATION NOISE 


2.1 General Development 


Consider the system model shown in Fig. 1 where an FM signal is put 
into a linear transmission medium followed by an AM/PM converting 
device. The transfer function of the transmission medium is 


Y(w) = exp [—ew) — BW)] (1) 


and the impulse response is 


g(a) = = iz OM OT (2) 
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The FM signal input to the transmission medium is 


v(t) = exp {led + v(d)]} (3) 
and the output signal is 
nit) = exp [a(t)] exp {ile + go(é)]}, (4) 


where w, is the carrier frequency and ¢(é) is the phase modulating signal. 
Since Y(w) is a linear system, the input and output can be related by 


n(t) = [oft — 2)g(a) dx. (5) 

Substituting (3) and (4) in (5) gives 
exp [a(t)] exp [i¢.(t)] = iz exp [ip(¢ — 2) — tw,.a]g(x) dx. (6) 
The output function, ¢.(é), was the subject of a previous paper? and will 
not be considered further here. Our prime objective is to determine the 


envelope variation in terms of its functional relationship to the phase 
modulating signal, g(t). It follows from (6) that 


a(t) = Reln ‘x exp [te(t — x) — tw.ar]g(a) de. (7) 


It can be shown that? 


a(t) = —alf.) + muse! — Mg” + Meg” — MA gi 4 
lon 4 y sp yo er 12 ~ ls. 2 
se ee er? 3? a? + (8) 


lis ny Asi 7 Mar 4 
ae ¢g ied [r gn? ’ 


where the subscripts r and 7 denote the real and imaginary parts of the 
corresponding coefficients, and the prime notation indicates the deriva- 
tive with respect to time. The argument of the phase functions in (8) 
is ¢ — ta where ¢g is an arbitrary delay.2 The moments, m, , in (8) are 
related to the transmission medium by 


(=i) 
v= VGo. Fo 


and the 7 and X coefficients are defined as follows: 


Y (we + w) exp Cita) | a (9) 
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Lh = m — 2mm; — m2? + 2Zm2mMe 

lo = m3 — mMymMe 

lI, = mM, — MyM 

ls = Ms — me 

Aye = Mm — Mz? 

3 = M — 38mm. + 2m 

Ms = Mm, — 4Amynzg — 38m? + 12 mem. — Gmi'. 


As an example, we have 
hi = ma — 2mm — 2Zums, — 2m1i?me; 


since 71, = 0 (Appendix I of Ref. 2). 
For the following transmission medium 


Y(w + We) = [1 + giw + gow" + go" + gsc” 


N 
+ >> us cos (Pyw + 65)] exp co + bso” + daw" 
J=i (10) 


N 
+ De vy Sin (quw + «| 


the moments m, given by (9) have been evaluated and expressed in 
terms of the transmission deviations in Appendix I of Ref. 2. 

For the analysis to follow, the transmission deviations in (10) are 
limited to values typically encountered in broadband radio relay systems. 
However, the ripple type transmission deviations must have ripple 
periods greater than approximately twice the top baseband frequency. 
These restrictions are dictated by the limited number of terms of a(é) 
which are to be considered. 

Referring once again to Fig. 1, we see that when the output signal from 
Y(w) passes through the AM/PM converter the envelope perturbation 
given by a(é) is converted into a phase perturbation, given by k a(t). 
The k coefficient is related to K (degrees/dB) as follows: the envelope 
distortion term expressed in dB is 


20 log = dB = 8.686 a(t) dB 


so the phase distortion, due to envelope perturbations, after AM/PM 
conversion is 
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8.686K a(t) degrees = 0.1516 K a(é) radians. 
We will let 
k = 0.1516 K radians. 


Hence, the phase distortion function, due to envelope variations, after 
the AM/PM converting device is 


k a(é) radians, (11) 


where a(t) is given by (8). 

The analysis up to here has been perfectly general (except for the 
assumption that the AM/PM process may be represented by a constant 
factor). The terms in (8) consist of first- (linear), second-, third- and 
higher-order functions of the input phase modulating signal, g(t). The 
linear terms produce baseband amplitude distortion which we shall not 
concern ourselves with in this paper. Also, terms higher than third 
order will not be considered. This is not an undue restriction because 
the prime contributors of intermodulation type noise in broadband 
systems are second- and third-order phase distortion terms. Therefore, 
neglecting linear, fourth-, and higher-order terms in (8) gives* 


k a(t) =k E y'e” _ lor gl! = Aer 2 e ie gt | 


6 2 8 
) ‘ (12) 
li /2 » ets A3i 13 a 
+h [Moe “| radians 
Using the relationships 
ee tae s. 1 tt 
i’ 2¢'¢ 
d 12 . sue Qn” 
ane See + 29 
d on _ lo» 
ae 3¢-¢ 
in (12) gives 
k a(t) = 60(t) + 43;(t) = @,r(t) radians, (13) 


where 


*Tt should be noted that additional second- and third-order terms exist which 
are not shown in (8) nor included in (12). These additional terms are considered 
to be negligible for the transmission deviation constraints previously mentioned. 
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2 
Ox(t) = k |-™ Be eee ede Ok is |e (14) 


4dti 12d? 24 
with 
Lay = Als, = 3ls, (15) 
and 
=s = nat hi d gy’. 


In the Appendix it is shown that the second-order distortion, 62(é), 
and the third-order distortion, @3(£), are uncorrelated. Hence, the total 
AM/PM intermodulation noise power density spectrum, considering 
only second- and third-order distortions, is the sum of the two individual 
noise power density spectra. 


2.2 Second-Order Noise Power Density Spectrum 


In this section we will derive the equation for the second-order 
AM/PM intermodulation noise power density spectrum. The time repre- 
sentation for the second-order phase distortion due to AM/PM conver- 
sion was derived in the previous section and is 


= _ Aer lo, d L3y d 12 ls, y2 
62(t) = e| sake oe 2 bole +k Ex : (17) 


The terms in brackets are operators on their respective functions, so (17) 
can be represented by the block diagram shown in Fig. 2, 


where 
= Ls, 2 Aer . lo, 
Gila) E - a5 E o| (18) 


and 


p= (t) 





Fig. 2— Second-order noise block diagram. 
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1 1 
Z Galo) am ag [4 ase (19) 


It can easily be shown, using the relationship for the cross-correlation of 
linearly transformed random functions,’ that the power density spectrum 
of 62(t) is 


So,(w) = Gi(—w) Gilw) Sy2() + Gi(—w) Grlw) Syrry2() 
+ G2x(—w) Gi(w) Syrr2y2(w) + G2(—a) Ga(w) Syr2(w) 


where, for example, S,2,-2(w) is the cross-power density spectrum of 
y” (t) and y(t). As in Ref. 2, (20) can be expressed as 


So(w) = 2| G(w) / F[Re'(r)] + 2| Gr(w) P FR?) 
+ 2Gi(—&)G2(w) + G2(—w)Gi(w)] F[Ree (7); 


where Gi(w) and G2(w) are given by (18) and (19), respectively, and 5 
stands for the Fourier transform. 

Now, redefining the transfer functions given in (18) and (19) we can 
write 


(20) 


(21) 


580 (0) = 2/Gi(w) [ FlRe(r)] + 2| Gow) | FR (r)] 





(22) 
+ 21G1(—w)G2(w) + G2(—&)Gi(o)] F [Ree (r)] 
where now 
Gi(w) = E w — re a |= | (23) 
1 
Gow) = 94 lar. (24) 


Equation (22) is the second-order AM/PM intermodulation noise power 
density spectrum weighted by the AM/PM conversion parameter. The 
ability to pull the & out of the calculation provides great flexibility. 


2.3 Third-Order Noise Power Density Spectrum 


The time representation for the third-order phase distortion due to 
AM/PM conversion is, from (16), 


ae Asi, as A) ots 
6:(t) =k | - rc. + Bs) <| ge (25) 


which can be represented by the block diagram in Fig. 8 where 
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sakes 6. (t) 
7’ (t) Bus 3 


Fig. 3— Third-order noise block diagram. 


1 3; . Li; 
i G3(w) = | - Ma) 4 if is o|. (26) 


It follows that the third-order AM/PM intermodulation noise power 
density spectrum is 


Sig(0o) = | Gy(w) |? Sse). 
It can be shown that® 
Sps(w) = 6 F[Ry(r)] + IR’) Se), (27) 
which can be written 
Sps(w) = 65[Ry"(r)] (28) 


since 9 R,?(0) S,-(w) is a sealed power density spectrum of the input I-M 
signal and hence can be neglected since it does not contribute to the 
distortion.* Therefore, 


So,(w) = 6 | Gy(w) |? FLRo*(r)) (29) 


where G3(w) is given by (26). Redefining the transfer function we have 


= 81,(a) = 6| Go) P S(Ry?(7)] (30) 


Galo) = | — | + | He], (31) 


Hence, (80) gives the third-order AM/PM intermodulation noise power 
density spectrum weighted by the AM/PM conversion parameter. 

A quantity of interest in engineering problems is the signal-to-noise 
ratio. Thus, we now characterize the simulated multichannel baseband 
signal. 


where now 


2.4 Pre-Emphasized Signal Power Density Spectrum 
The basic block diagram arrangement for a typical signal transmission 
path is shown in Fig. 4. The unpre-emphasized baseband signal is ob- 


* This term causes baseband amplitude distortion instead of intermodulation 
noise. 
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PRE- 
EMPHASIZED 


BASEBAND BASEBAND 
SIGNAL PRE-EMPHASIS| SIGNAL SIGNAL ey, 
NETWORK w) 
Yi (t) Pp (w) p(t) MGbUL ATER 
v(t)=cos - t+ (t)] 


Fig. 4— Typical signal transmission path. 


tained from the frequency division multiplex terminals, directly or via a 
transmission facility, and is pre-emphasized prior to being applied to a 
I'M modulator. The output of the FM modulator is consistent with v(t) 
shown in Tig. 1. Assume that the unpre-emphasized baseband signal has 
a Gaussian distribution and a flat power density spectrum, P, , between 
—f, and f,, where f, is the top baseband frequency. The output power 
density spectrum from the pre-emphasis network is 


Se(w) = Po[P@) 7, Ifl Sh (32) 
where P(w) is the transfer function of the pre-emphasis network. Letting 
| Pw)? = ao + af + aft + af’, [f| S he, (33) 
where the a’s are real constants, we have 
Sew) = Pola + af? + aft + af], |f| Sf. (84) 
It can easily be shown that? 
(270)? : 
z Ne 
of, (« + he 4 Uh ah 


where o = rms frequency deviation, in Hz, due to the baseband signal, 
and f, is in Hz. Equation (34) gives the power density spectrum of the 
pre-emphasized baseband signal in terms of the coefficients of a con- 
tinuous pre-emphasis characteristic, and in terms of system parameters, 


o and f, . 





Pi (rad/sec)’/Hz 


(35) 





2.5 Signal-to-Noise Ratio 


We are now in a position to express the signal-to-noise ratio for second- 
and third-order AM/PM intermodulation noise. The expressions given 
in (22) and (30) are for PM distortions so we convert them to FM dis- 
tortions by multiplying by w. Hence, the signal-to-noise ratios can be 
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expressed as 


Sy’ (w) 
sae | 7 — 20 log i 
10 log =~ | = 10 log 2 36 
[1008 350) | ao (Z) sae _ 
and 
Sy’ (w) 
ol > — 20 log 
10 log =5-,< | = 10 log 7 
| ' wo So, (w) grd tee (@) So, (w) iS 


where S,-(w) is given by (34), 1/k? S»,(w) is given by (22), and 1/k? 
So,(w) 1s given by (80). A digital computer program has been written 
which will evaluate (36) and (87) for any values of the transmission 
deviation coefficients, pre-emphasis coefficients, rms frequency deviation 
due to the baseband signal, top baseband frequency, and AM/PM con- 
version factor. The derivations of SLR, (r)], SR," (r)]; [Ry (r)], 
and 3$[i,’(r)] in a form applicable to a digital computer program are 
given in Appendix II of Ref. 2. 


III. NOISE PROPERTIES AND CHARACTERISTICS 


The previous material provided the mathematical treatment of 
AM/PM intermodulation noise. In this section we will document the 
various properties of both AM/PM intermodulation noise and inter- 
modulation noise due to transmission deviations.* Also, the charac- 
teristics of these two noise phenomena will be explored by utilizing a 
representative system model. Both noise contributors are treated in 
parallel throughout the section for comparison purposes. The results 
are presented in three discrete modes: (2) properties which are true in 
general; (77) properties which are approximately true; and (222) charac- 
teristics which are derived from a representative system model. The 
theoretical treatment previously presented was for a transmission 
medium given by (10). In this section we will confine our analysis to 
the power series transmission deviations in (10). This is done for two 
reasons: (7) the properties of the two noise phenomena can be concisely 
documented for power series transmission deviations; and (72) the gain 
and phase ripple properties need more analysis as well as mathematical 
treatment in order to fully characterize the effects of ripples in the 
transmission medium. 


*The information for this latter noise contributor was obtained from Ref. 2, 
which gives it implicitly, as well as from the associated digital computer program. 
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3.1 General Properties 


Equation (8) of this paper and (23) of Ref. 2 have been expressed 
in terms of the transmission deviations and tabulated as shown in Table 
I. This table is an extension of Table 21-1 of Ref. 4. 


TasLE IJ—AMPLITUDE AND PHASE MopuLATIoN CAUSED 
BY TRANSMISSION DEVIATIONS 


Type of transmission Resulting amplitude Resulting phase 











deviation modulation, a(é) modulation, go(t) — ¢(t) 
| 
Linear gain, gi nye’ — 29e"? + ignite’? 
Parabolic gain, gz | gop’? + 3929”? —goe"” + grep" 
Cubic gain, g3 —gse!"’ + gre” — 3939/9” 
Quartie gain, g4 —4gip'e!"’ — 391”? gag!" — Ogse"p” 
Parabolic phase, be | beep” + 2b22p’y’"’ + be2p”? —Fbo29''"’ + bop” 
aa Qborp/2y” 

Cubic phase, b; 3b39'¢" —b3p''’ + b3e"8 
Quartie phase, b, —big’""’ + Bbhavp’”2e” —Abyp'e!"’ — 3b”? 
Interaction terms —[gibs + gebele’’”’ —giboe’’’ + gigee'e” 

+ [29g:bele’e” + gigse’e’”" — 4(gibs + geba)y'y’”’ 

—ggoe'? + [891b3 + 4gob2 —3(gibs + gabs)e”? 
— 29)°b2|p"e” + (89193 — 91?g2)e"~” 














Input signal = exp {z[w-t + g(¢)]}; output signal = exp [a(t)] exp {ilwet + ¢(t)]}; 
transmission medium transfer function = Y(w + w.) = [1 + giw + gow? + gzw* 
+ gyw*t] exp {t[bow? + byw? + byw']}. 

The argument of all the amplitude and phase functions is ¢. 


The order of the noise produced by different transmission deviations 
(e.g., g:1, b2) are given in Table II for intermodulation noise due to 
transmission deviations and for AM/PM intermodulation noise. Two 
rules of thumb can be stated. For intermodulation noise duc to trans- 
mission deviations the rule is: 

Even-order gain and delay transmission deviations cause odd-order notse. 


TABLE II — OrpER oF NOISE 














Intermodulation noise 
Transmission deviation 
Due to transmission deviations Due to AM/PM conversion 

Linear gain (g1): No noise *Second and third 
Parabolic gain (g2): Third Second 

Cubic gain (g3): Second Third 

Quartic gain (g4): Third Second 

Linear delay (62): *Second and third Second 

Parabolic delay (bs): Third Second 

Cubic delay (b,): Second Third 











* Indicates predominant component of the two possible, 


NOISE IN FM SYSTEMS 1761 


Odd-order gain and delay transmission deviations cause even-order noise. 
For AM/PM intermodulation noise the rule is, for those transmission 
deviations that cause significant relative noise (will become apparent 
later), 

Even-order gain and delay transmission deviations cause even-order noise. 

Odd-order gain and delay transmission deviations cause odd-order notse. 

The two types of intermodulation noise are related to the magnitude 
of the transmission deviation coefficient by the relationships shown in 
Table IIT. Once a noise response is obtained for a particular system and 
transmission deviation coefficient value, then the system noise for any 
other coefficient value typically encountered in transmission systems 
can be easily predicted. 


3.2 Approximate Properties 


The variation in the top message channel noise, for both noise con- 
tributors, with number of channels, assuming the peak frequency devia- 
tion remains constant as the number of message channels increase, is 
shown in Table IV for the different transmission deviations. These 
approximate relationships yield results with an error of <1 dB for 
smooth pre-emphasis functions typically used in broadband radio 
systems. 

The assumptions used were that the peak frequency deviation re- 
mained constant, and that a typical frequency division multiplex plan 
was used. The rms frequency deviation, due to the baseband signal, 


Taste IJ] — VARIATION OF RELATIVE NOISE witTH TRANSMISSION 
DEVIATION COEFFICIENT VALUE 


Intermodulation noise 
Transmission deviation 








Due to transmission deviations Due to AM/PM conversion 
Linear gain (91): No noise *40 jog : gi’/g1 | , 60 log 
Parabolic gain (g2): 40 log | go’ /g2 | yi ‘log has ’/g2| (approxi- 
mation error <3 dB) 
Cubie gain (gz): 20 log | gs’/gs 20 log | g3’/gs 
Quartic gain (g,): 20 log | ga'/gs 20 log | ga’/ga 
Linear delay (62): *20 log is “Is [, 40 log | 40 log | b2’/be 
Parabolic delay (bs): 20 log | b3’/bs 20 log | b3’/bs 
Cubic delay (64): 20 log | ba’/b4 20 log | b4’/bs 














Where the prime (’) notation depicts the terminal value and the unprimed 
notation indicates the initial value. ; 
* Indicates predominant component of the two possible. 
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TABLE ITV — VARIATION OF Top CHANNEL NOISE witH NUMBER OF 
MeEssAGE CHANNELS 








Intermodulation noise 
Transmission deviation 
Due to transmission deviations D pas eM 

Linear gain (q1): No noise = 20 log N’/N 

Parabolic gain (gz): Relative top channel noise increase 
= 41 log N’/N | = 21 log N’/N 

Cubic gain (g3): Relative top channel noise increase 
= 39 log N’/N | & 21 log N’/N 

Quartic gain (g4): Relative top channel noise increase 
= 41 log N'/N | & 57 log N’/N 

Linear delay (b2): Relative top channel noise increase 
= 21 log N’/N | = 58 log N’/N 

Parabolic delay (63): | Relative top channel noise increase 
= 23 log N’/N | & 39 log N’/N 

Cubic delay (6,): Relative top channel noise increase 
=~ 58 log N’/N | = 40 log N’/N 














Where N’ = increased number of channels; NV = initial number of channels. 


was allowed to change, accordingly, as the number of message channels 
increased. 


3.3 Noise Characteristics 


3.3.1 Representative System Model 


As a system model, we will use the following system parameters: 
N = number of message channels = 1200 
fs = top baseband frequency = 5.772 MHz 
AF = peak frequency deviation = 4 MHz 
o = rms frequency deviation due to the multichannel baseband 
signal = 0.771 MHz. 
The pre-emphasis characteristic is shown in Fig. 5 and can be expressed 


by 
| P(w) ? = 0.9989 + 3.5839 X 107° f? 
—5.0245 X 10-* f* + 3.894 x 107° f6, 


where f is in MHz. 


3.3.2 Noise Response for the Individual Transmission Deviations 


It is instructive to show the individual noise responses on a compara- 
tive basis. This can be done by letting all gain transmission deviations 
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BASEBAND FREQUENCY IN MHz 


Fig. 5 — Pre-emphasis characteristic. 


have 1-dB distortion, relative to the carrier, at 10 MHz away from the 
carrier. Also, we let all delay transmission deviations have 1 nanosecond 
(ns) distortion, relative to the carrier, at 10 MHz away from the carrier. 
This allows us to directly compare the noise contributions of the different 
gain and phase transmission deviations, respectively, and also allows 
for some sort of pseudo comparison between a 1-dB gain distortion and 
a l-ns delay distortion. The intermodulation noise response, due to 
transmission deviations, for the different transmission deviations are 
shown in Fig. 6. Similarly, the AM/PM intermodulation noise responses 
are shown in Tig. 7. Note that the responses in ig. 7 are for k = 1.0 
radian or a 6.6 degrees/dB AM/PM conversion device. For any other 
value of k, say k;, we raise or lower the responses according to 20 log 
ky , as indicated by (36) and (87). 

It is interesting to note that linear delay is an important contributor 
to intermodulation noise, due to transmission deviations, but is not a 
significant AM/PM intermodulation noise contributor. Also, we observe 
that parabolic gain is a large relative contributor for AM/PM inter- 
modulation noise but is a negligible relative contributor for intermodula- 
tion noise due to transmission deviations. As a side point, we point out 
that parabolic gain is also a significant source of baseband amplitude 
distortion. 

The phase transmission deviation noise responses in Figs. 6 and 7 
are of particular interest because the values used in these two figures 
are realistic even for an equalized system; this is not the case for the 
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Fig. 6 —Intermodulation noise due to transmission deviations. 
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Fig. 7— AM/PM intermodulation noise (k = 1.0 radian). 
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gain transmission deviation values used so one need not be unduly 
alarmed at first inspection of the noise responses shown. However, the 
gain deviation noise responses are of interest in order to determine 
which types of gain deviations a particular noise source is most sensitive 
to. 

Comparison of Figs. 6 and 7 and Table II show that for all transmis- 
sion deviations of significance, the order of the noise distortions are 
different for the two noise phenomena. Coupling this with the results 
of the Appendix shows that for a given transmission deviation, the two 
noise responses are uncorrelated. 

Of great significance and importance is the parabolic delay AM/PM 
intermodulation noise response. We see from Fig. 7 that this particular 
delay deviation is, by far, the largest contributor of AM/PM inter- 
modulation noise compared to the other delay terms. The importance 
of this finding lies in the fact that TWT amplifiers, when used as output 
power tubes in broadband radio systems, are separated from transmitter 
modulators (used to go from IJ* to RF) by band pass filters which may 
possess large amounts of parabolic delay. Hence, we have large parabolic 
delay distortion prior to an important AM/PM conversion device. 
The noise impairment due to this typical system arrangement will be 
examined in a later section. . 

Another point of interest is the noise response for linear gain. We 
see that linear gain is not a significant AM/PM intermodulation noise 
contributor. This is useful knowledge because in the past system require- 
ments for linear gain have been set based on speculated AM/PM inter- 
modulation noise impairments, as well as on derivable baseband ampli- 
tude distortion due to linear gain and AM/PM conversion. 


3.3.3 Effects of Interaction Terms 


Referring back to Table I we note the row marked interaction terms. 
By the form of the terms involved it is obvious why they are so named. 
If one were to evaluate (22) and (30) in terms of the transmission devia- 
tions explicitly, he would find that over 80 percent of the terms are 
interaction terms. To examine the effects of these interaction terms we 
compare the response we would get if we combined the curves shown 
in Fig. 6, for example, on a power basis with the response we would 
obtain by using all the transmission deviations at once, 1.e., by taking 
into account the interaction terms. There are a large number of possi- 
bilities that could be examined, but to put the problem in perspective 
the analysis considered only the cases shown in Table V. The results 
for the two noise phenomena are shown in Figs. 8 and 9. The responses 
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TABLE V — CaSES CONSIDERED IN THE STUDY OF 
INTERACTION EFFECTS 


Case Condition* 





— 


Power addition of noise responses due to individual transmission devia- 
tions (all g’s and b’s positive) 

Noise response under the condition: g; negative and all other g’s and 
b’s positive 

Noise response under the condition: gz negative and all other g’s and 
b’s positive 

Noise response under the condition: g3 negative and all other g’s and 
b’s positive 

Noise response under the condition: g, negative and all other g’s and 
b’s positive 

Noise response under the condition: b2 negative and all other g’s and 
b’s positive 

Noise response under the condition: b3; negative and all other g’s and 
b’s positive 

Noise response under the condition: bs negative and all other g’s and 
b’s positive 

Noise response under the condition: all g’s and b’s positive 

Noise response under the condition: all g’s and b’s negative 





eon oan Oo oO FP WD WwW 


—_ 











Where | g: | = 1 dB at 10 MHz 
|g2| = 1 dB at 10 MHz 
|g: | = 1 dB at 10 MHz 
| gs| = 1 dB at 10 MHz 
| be | = 1 ns at 10 MHz 
|b; | = 1 ns at 10 MHz 
| bg | = 1 ns at 10 MHz 


Il 





* All the conditions take into account the effects of interaction terms except 
for case l. 


shown in Jig. 8 are rewarding from a systems analysis standpoint be- 
cause it indicates that the interaction components for intermodulation 
noise, due to transmission deviations, do not significantly perturb 
the noise response obtained by adding up the individual transmission 
deviation noise responses on a power basis. Hence, for this noise source, 
a system analyst could set requirements based on power addition of 
the individual noise responses and be fairly confident that the actual 
system noise response, due to transmission deviations, will be within 
a dB of that response. 

We sce from Fig. 9 that the above desirable property does not hold 
for AM/PM intermodulation noise. The responses shown in Fig. 9 
deviate significant amounts from the power addition response (ease 1) 
by mere shifts of signs, the greatest departures occurring for parabolic 
and quartic gain distortion which are, in their own right, the largest 
relative noise contributors as evident from Fig. 7. The relative tendencies 
indicated in Fig. 9 also occur when typical equalized repeater trans- 
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Fig. 8—Intermodulation noise due to transmission deviations — effects of in- 


teraction terms (refer to Table V for case listing). 
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Fig. 9— AM/PM intermodulation noise — effects of interaction terms (k = 


1.0 radian) (refer to Table V for case listing). 
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mission deviation values are used. In fact, least squares approximations 
on equalized repeaters, RF band pass filters, ete., yield values for para- 
bolic and quartic gain which are either, or both, negative, i.c., a loss 
with increasing frequency. Hence, even under practical situations one 
cannot, in general, expect power addition of the individual transmission 
deviation AM/PM intermodulation noise responses to yield representa- 
tive AM/PM intermodulation noise performance. 


3.3.4 Nowse Response for a Representative Radio System Repeater 


The results up to this point utilized representative system parameters, 
but normalized values for the transmission deviations were used. Of 
interest is the predicted noise response for a typical situation, i.e., making 
use of values typically encountered in practice. We will now use the 
representative gain and delay responses shown in Fig. 10 for an un- 
equalized and equalized radio repeater. The predicted intermodulation 
noise responses, due to transmission deviations, are shown in Fig. 11. 
It is obvious that the equalization has greatly improved the system’s 
noise response. 

To examine the AM/PM intermodulation noise we take note of the 
previously mentioned fact that the TWT has a band pass filter (whose 
gain and delay responses are given in Fig. 10) preceding it. The AM/PM 
intermodulation noise due to the band pass filter and the TWT amplifier 
(assuming 2.5 degrees/dB) is also shown in Fig. 11. 
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Fig. 10— Gain and delay characteristics. 
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Fig. 11— Representative radio system noise response. 


We see from Fig. 11 that the AM/PM intermodulation noise in the 
top channel is much larger than the intermodulation noise due to trans- 
mission deviations for an equalized repeater. The repeater equalizer 
is designed to correct for gain and delay shapes obtained from measure- 
ments which do not recognize the AM/PM conversion phenomenon. 
Hence, repeater equalizers based on such measurements, even though 
effective for reducing intermodulation noise due to transmission devia- 
tions, prove ineffective for AM/PM intermodulation noise which occurs 
as indicated. In other words, the system has an AM/PM intermodula- 
tion noise floor which is transparent to external gain and delay measure- 
ments. 

The transmission deviation of the band pass filter which is the major 
noise contributor is the parabolic delay term. Hence, to reduce the 
AM/PM intermodulation noise one must devise some method of cor- 
recting for this transmission deviation. Two means of equalizing the 
band pass filter are: (2) pre-equalization at IF prior to up-converting 
in the transmitter modulator; and (22) microwave equalization directly 
before or after the bandpass filter. The first method may not yield 
perfect correction because up-converters using varactor diodes possess 
AM/PM conversion characteristics, in some cases 1 degree/dB. In 
essence, it would effectively be like trading noise due to 2.5 degrees/dB 
for noise due to 1.0 degrees/dB or an 8-dB improvement in the ideal 
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case, i.e., nO compression in the up-converter and a perfect inverse 
band pass filter characteristic. However, an improvement anywhere 
near this value would greatly reduce the effects of AM/PM conversion. 


IV. CONCLUSIONS 


Two noise contributors in IM systems are: (z) intermodulation noise 
due to transmission deviations; and (zz) intermodulation noise due to 
transmission deviations and AM/PM conversion. This latter source 
of noise is designated ‘““AM/PM intermodulation noise” in this paper. 
Analysis was carried out in order to predict the second- and third-order 
AM/PM intermodulation noise for the transmission medium given in 
(10) and a continuous pre-emphasis function. Flat Gaussian noise was 
used to simulate the unpre-emphasized baseband signal so the results 
are consistent with the laboratory system tests using ‘noise loading”’. 
Expressions were derived which specify the signal-to-noise ratio in 
terms of system parameters, transmission deviations, pre-emphasis 
characteristics and AM/PM conversion parameter. The latter param- 
eter, assumed to be a real constant, was separated from the body of the 
calculations so that the resulting noise responses could be easily altered 
for any value of AM/PM conversion. 

The paper presented general noise properties and characteristics for 
the two noise contributors. This material was presented in parallel, 
for the two noise contributors, for comparison purposes. The order of 
the noise component for different transmission deviations was given 
so that one would know if a given transmission deviation causes second- 
or third-order noise. The variation of the relative noise with transmission 
deviation coefficient value was given so that a system analyst can deter- 
mine the relative detriment to a system response that would result from 
a change in a given transmission deviation. Another useful result was 
the variation of top channel noise with number of message channels. 
This would be of use in the case where one is interested in increasing a 
system’s message channel capacity. 

Noise responses were given using a representative radio system model. 
It was found when all gain transmission deviations had the same distor- 
tion and when all delay transmission deviations had the same distortion 
that: (2) for intermodulation noise due to transmission deviations the 
cubic and quartic gain terms created the greatest top channel noise 
due to gain transmission deviations, and that linear delay created the 
greatest top channel noise due to delay transmission deviations; and 
(it) for AM/PM intermodulation noise the parabolic and quartic gain 
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terms created the greatest top channel noise due to gain transmission 
deviations, and that parabolic delay created the greatest top channel 
noise due to delay transmission deviations. The effects of interaction 
terms were examined. It was found that interaction terms do not signifi- 
cantly perturb the noise response from that of the case of power addition 
of the individual noise responses for intermodulation noise due to trans- 
mission deviations. However, this desirable property did not hold for 
AM/PM intermodulation noise which says that power addition of the 
individual noise responses may be in gross error; in other words, inter- 
action terms must be considered when evaluating AM/PM intermodula- 
tion noise. 

The intermodulation noise due to both noise contributors was pre- 
dicted for a representative radio system repeater. It was observed that 
the AM/PM intermodulation noise due to the band pass filter preceding 
the TWT amplifier created more top channel noise than that due solely 
to the equalized transmission characteristic, i.e., intermodulation noise 
due to transmission deviations. Possible correction methods were given. 

A point of interest, is that the two noise contributors considered in 
this paper are correlated so that combining the two spectra together 
assuming random addition, i.e., power addition, is not sufficient in 
general. The significance of this correlation is presently being examined 
and will be reported on in a later paper. 
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APPENDIX 


Uncorrelated Second- and Third-Order Distortions 


We will show here that the second-order distortion, @:(t), and the 
third-order distortion, 63(¢), are uncorrelated. Consider 


Or(t) = 62(¢) + 43(Z). (38) 
Now the autocorrelation function of @7(¢) is 


Rop(t) = Ro(r) + Ro,(7) + Raye,(7) + Rese, (r) (39) 


1772 THE BELL SYSTEM TECHNICAL JOURNAL, DECEMBER 1966 


where, e.g., 2o,0,(7) is the cross-correlation function of 6:(¢) and @3(t). 
Taking the Fourier transform of (39) gives 


Sop(w) = So(w) -- So;(@) + So,0;,(@) + So,0,(@). (40) 
From Fig. 2, we have 
O2(t) = x(t) + y(d) (41) 
so it follows that 
Re0,(7) = Rzo(7) + Ryo,(r). (42) 


Referring to Figs. 2 and 3 we have, using the relationship for the cross- 
correlation of linearly transformed random functions,? 


Se,0,(w) = Gi(—w) Gs(w) S,r2,13(w) + Ge(—w) Gs(w) Syr2,73(w). (43) 
Now we can write 
Syrga(w) = F[Ry23(7)] = F lave (o%"%)] (44) 
and 
Syrr2g'3(@) = F[Ryvaa(r)] = F [ave (6%"%)], (45) 


where & stands for the Fourier Transform. 

The phase modulating signal, g(¢) represents the multichannel message 
load and so for a large number of talkers g(t) is Gaussian with zero 
mean. It follows that derivatives of ¢(¢) are Gaussian with zero mean. 
It can be shown that® 


n 
ave [a1'! +++ 2n'"] = 0, >> r: odd (46) 
t=1 
where 21 °-:* %, are Gaussian random variables with zero mean, and 
f*++ Tn are any set of integers. Hence, letting 
1 = ¢’ 
2 = g’ 


in (44), and letting 


in (45) gives, using (46), 


NOISE IN FM SYSTEMS 1773 


Similarly, 


Se,0.(w) = 0. 


Hence, 62(é) and 6;(¢) are uncorrelated so 


So7(w) = So,(w) + Se, (w). 
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Synthesis of Band-Limited Orthogonal 
Signals for Multichannel Data 
Transmission 


By ROBERT W. CHANG 
(Manuscript received August 4, 1966) 


This paper presents a principle of orthogonal multiplexing for trans- 
mitting a number of data messages simultaneously through a linear band- 
limited transmission medium at a maximum data rate without interchannel 
and intersymbol interferences. A general method is given for synthesizing 
an infinite number of classes of band-limited orthogonal time functions in 
a limited frequency band. Stated in practical terms, the method permits the 
synthesis of a large class of practical transmitting filter characteristics for 
an arbitrarily given amplitude characteristic of the transmission medium. 
Rectangular-shaped ideal filters are not required. The synthesis procedure 
zs convenient. Furthermore, the amplitude and the phase characteristics 
of the transmitting filters can be synthesized independently. Adaptive correla- 
tion reception can be used for data processing, since the received signals 
remain orthogonal no matter what the phase distortion ts in the transmission 
medium. The system provides the same signal distance protection against 
channel notses as if the signals of each channel were transmitted through 
an independent medium and intersymbol interference in each channel were 
eliminated by reducing data rate. 


I. INTRODUCTION 


In data transmission, it is common practice to operate a number of 
AM data channels through a single band-limited transmission medium. 
The system designer is faced with the problem of maximizing the overall 
data rate, and minimizing interchannel and intersymbol interferences. 
In certain applications, the channels may operate on equally spaced 
center frequencies and transmit at the same data rate, and the signaling 
intervals of different channels can be synchronized. For these applica- 
tions, orthogonal multiplexing techniques can be considered. Several 
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orthogonal-multiplexed systems developed” in the past use special 
sets of time-limited orthogonal signals. These signals have widely 
spread frequency spectra (e.g., a sin «/x spectrum). Consequently, 
when these signals are transmitted through a band-limited transmission 
medium at a data rate equivalent to that proposed in this paper, certain 
portions of the signal spectrum will be cut off and interferences will 
take place. For instance, the interferences because of band-limitation 
have been computed’ for a system using time-limited orthogonal sine 
and cosine functions. 

This paper shows that by using a new class of band-limited orthogonal 
signals, the AM channels can transmit through a linear band-limited 
transmission medium at a maximum possible data rate without inter- 
channel and intersymbol interferences. A general method is given for 
synthesizing an infinite number of classes of band-limited orthogonal 
time functions in a limited frequency band. This method permits one to 
synthesize a large class of transmitting filter characteristics for arbi- 
trarily given amplitude and phase characteristics of the transmission 
medium. The synthesis procedure is convenient. Furthermore, the 
amplitude and the phase characteristics of the transmitting filters can 
be synthesized independently, i.e., the amplitude characteristics need 
not be altered when the phase characteristics are changed, and vice 
versa. The system can be used to transmit not only binary digits (as 
in Ref. 1) or m-ary digits (as in Ref. 2), but also real numbers, such as 
time samples of analog information sources. As will be shown, the system 
satisfies the following requirements. 

(¢) The transmitting filters have gradual cutoff amplitude charac- 
teristics. Perpendicular cutoffs and linear phases are not required. 

(it) The data rate per channel is 2f, bauds,* where f, is the center 
frequency difference between two adjacent channels. Overall data rate 
of the system is [V/(N + 1)] Rinax , where N is the total number of 
channels and Rmax , Which equals two times the overall baseband band- 
width, is the Nyquist rate for which unrealizable rectangular filters 
with perpendicular cutoffs and linear phases are required. Thus, as NV 
increases, the overall data rate of the system approaches the theoretical 
maximum rate Rmax , yet rectangular filtering is not required. 

(iat) When transmitting filters are designed for an arbitrary given 
amplitude characteristic of the transmission medium, the received 
signals remain orthogonal for all phase characteristics of the transmission 
medium. Thus, the system (orthogonal transmission plus adaptive 


* The speed in bauds is equal to the number of signal digits transmitted in one 
second. 
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correlation reception) eliminates interchannel and intersymbol inter- 
ferences for all phase characteristics of the transmission medium. 

(iv) The distance in signal space between any two sets of received 
signals is the same as if the signals of each AM channel were transmitted 
through an independent medium and intersymbol interference in each 
channel were eliminated by reducing data rate. The same distance 
protection is therefore provided against channel noises (impulse and 
Gaussian noise). For instance, for band-limited white Gaussian noise, 
the receiver receives each of the overlapping signals with the same 
probability of error as if only that signal were transmitted. The distances 
in signal space are also independent of the phase characteristics of the 
transmitting filters and the transmission medium. 

(v) When signaling intervals of different channels are not syn- 
chronized, at least half of the channels can transmit simultaneously 
without interchannel or intersymbol interference. 


II. ORTHOGONAL MULTIPLEXING USING BAND-LIMITED SIGNALS 


Consider N AM data channels sharing a single linear transmission 
medium which has an impulse response h(é) and a transfer function 
H(f) exp [Jn(f)] (see Fig. 1).* H(f) and n(f) will be referred to, respec- 
tively, as the amplitude and the phase characteristic of the transmission 
medium. 

Since this analysis treats only transmission media having linear 
properties, the question of performance on real channels subject to such 
impairments as nonlinear distortion and carrier frequency offset is not 
considered here. Such considerations are subjects of studies beyond the 
scope of the present paper. 

In deriving the following results, it is not necessary to assume that 
the transmitting filters and data processors operate in baseband. How- 
ever, this assumption will be made since in practice signal shaping and 
data processing are usually performed in baseband. Carrier modulation 
and demodulation (included in the transmission medium) can be per- 
formed by standard techniques and need not be specified here. 

Consider a single channel first (say, the 7th channel). Let bo, bi, 
be, --+ , be a sequence of m-ary (m = 2) signal digits or a sequence of 
real numbers to be transmitted over the 7th channel. As is well known,* 
bo, b1, bg, +--+: can be assumed to be represented by impulses with 
proportional heights. These impulses are applied to the zth transmitting 
filter at the rate of one impulse every 7 seconds (data rate per channel 


* J denotes the imaginary number ~/—1, while 7 is used as an index. 
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Fig. 1— N data channels transmitting over one transmission medium. 


equals 1/7’ bauds). Let a;,(t) be the impulse response of the 7th trans- 
mitting filter, then the 7th transmitting filter transmits a sequence of 
signals as 

bo-ai(t) + bi-a(t — T) + bea, — 2T) + ---. 
The received signals at the output of the transmission medium are 


where 
u(t) = [ h(t — r)a,(r) dr. 


These received signals overlap in time, but they are orthogonal if 


oe] 


| UREA WS): BLASS CH 


As is well known, orthogonal signals can be separated at the receiver 
by correlation techniques;* hence, intersymbol interference in the 7th 
channel can be eliminated if (1) is satisfied. 

Next consider interchannel interference. Let co, c:, c2, +--+ be the 


* Correlation reception and its adaptive feature will be briefly discussed in 
Appendix C. 
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m-ary signal digits or real numbers transmitted over the jth channel 
which has impulse response a;(é). It has been assumed in Section I that 
the channels transmit at the same data rate and that the signaling inter- 
vals of different channels are synchronized, hence the jth transmitting 
filter transmits a sequence of signals, 


Co: a;(t) + C1: a,;(t = T) + C2 a;(t = 22) + eee, 
The received signals at the output of the transmission medium are 
couj(t) + q-u;(t — T) + ce-u,;(t — 27) +---. 


These received signals overlap with the received signals of the ith 
channel, but they are mutually orthogonal (no interchannel interference) 
if 


/ u(t)u,(t — kT) dt = 0, k = 0, +1, +2,---. (2) 


Thus, intersymbol and interchannel interferences can be simultane- 
ously eliminated if the transmitting filters can be designed (i.e., if the 
transmitted signals can be designed) such that (1) is satisfied for all 7 
and (2) is satisfied for all 7 andj (¢ ¥ 7). 

Denote U;(f) exp [Ju,(f)] as the Fourier transform of u,;(t). One can 
rewrite (1) as 


[ U2(f) exp (—J2afkT) df = 0 


k = +41, 42,--- (3) 
t= 1,2; 7 p aN 
and rewrite (2) as 
[Ud exp Lu USD) exp (—Sus(J)) 
-exp [—J2afkT] df = 0 
k = 0, +1, +2, --- (4) 
1,) = 1,2,°** ,N 
tA. 


Let A;(f) exp [Ja;(f)] be the Fourier transform of a,(t). The transfer 
function of the transmission medium is H(f) exp [Jn(f)]. Equation (8) 
becomes 
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a Ai (f)H'(f) exp (—J2afkT) df = 0 


ae ee aoe (5) 
i= Lei ety ING 


or 


” AZG)EE(S) cos QafhT df = 0 


k = 1,2,3, ++ (6) 
bs 1, 28+ 


Equation (4) becomes 
[ADA MNEEWD exp (laf) — a,(f) — 2afk TV} df = 0 


ee Vie ae are 
459) = 1; 2, pia 
2S 


(7) 


Writing (7) in real and imaginary parts and comparing parts for k = 
1,2,--- andk = —1, —2, --- , it is seen that (7) holds if and only if 


[ “ AADA/H(S) cos [ar(f) — aj(f)] cos 2afkT df = 0, (8) 


and 
[ AdNAMHHD) sin lau(f) — ai(f)] sin QafkT af = 0, (9) 
where 
k = 0,1, 2,--: 
tJ = 1, 2, pias ,N 
t FF 9. 


It will be recalled that the transmitting filters and the data processors 
operate in baseband. Let f; ,7 = 1,2, ---- , N, denote the equally spaced 
baseband center frequencies of the N independent channels. One can 
choose 


fi = (h + a)fe, (10) 
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where h is any positive integer (including zero), and f, is the difference 
between center frequencies of two adjacent channels. Thus, 


Je fe = DiS ee aaa (11) 


Carrier modulation will translate the baseband signals to a given fre- 
quency band for transmission. 
Each AM data channel transmits at the data rate 2f, bauds. Hence, 


1 
deh ie 2, 
T= Of seconds. (12) 


For a given amplitude characteristic H(f) of the transmission medium, 
band-limited transmitting filters can be designed (i.e., band-limited 
transmitted signals can be designed) such that (6), (8), (9), and (12) 
are simultaneously satisfied (no intersymbol and interchannel inter- 
ference for a data rate of 2f, bauds per channel). In addition, the five 
requirements in Section I are also satisfied. A general method of design- 
ing these transmitting filters is given in the following theorem. 


Theorem: For a given H(f), let A;(f),7 = 1, 2, --- , N, be shaped such that 
APOM YG) = Crt OG) oO: jer af Sirs 
=O ARS fey fear les 


where C; is an arbitrary constant and Q;(f) is a shaping function having 
odd symmetries about f; + (f5/2) and f; — (f./2), 2.e., 


af (+4) 4] = -0.[ (04 fr) |, o<s <%, 4) 
olle-Dot]--ale-D-rp vcr ow 


Furthermore, the function [C; + Q:(f)]- [Cia + Qii(/)] ts an even function 
about f; + (f:/2), 7.e., 


o. + Q; (+4 ple ag ‘| & tO (f +24 r') | 
: Lo: a (i 3 5-1) | on aa (i +3 et) (16) 


7 


(13) 


0<f< 
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Let the phase characteristic a;(f), 7 = 1,2, --- ,.N, be shaped such that 


a) =a = 5 tu). Sea Peters 
(17) 


t= 1,2,--°-,N —-1, 


where y:(f) is an arbitrary phase function with odd symmetry about f; + 
(fs/2). 

If A:(f) and a;(f) are shaped as in (18) through (17) and fi is set ac- 
cording to (10), then (6), (8), (9), and (12) are simultaneously satisfied 
(no intersymbol or interchannel interference for a data rate of 2f, bauds per 
channel). Furthermore, the five requirements in Section I are also satisfied. 

The proof of this theorem will be broken down into two parts. The 
first part [showing that (6), (8), (9), and (12) are simultaneously satis- 
fied] will be given in Appendix A. The second part (showing that the 
five requirements in Section I are satisfied) will be given in Section III 
following a discussion of the various choices of the shaping functions 
and transmitting filter characteristics. 


III. TRANSMITTING FILTER CHARACTERISTICS 


Consider first the shaping of the amplitude characteristics A;(f) of 
the transmitting filters. Equations (13), (14), and (15) in the theorem 
can be easily satisfied. Equation (16) can be satisfied in many ways. 
For instance, a simple, practical way to satisfy (16) is stated in the fol- 
lowing corollary. 


Corollary 1: Under the simplifying condition that 
(t) C; should be the same for all 2 
(iz) Q:(f), 7 = 1, 2, ---- , N, should be cdentically shaped, 1.e., 
Qis1(f) = Q:(f — fe), 1= iF a Myr pv = 1, (18) 
(16) holds when Q;(f) is an even function about f; , 7.e., 


Geb Ff) = Gite — 7), Of. as (19) 


The proof of this corollary is straightforward and need not be given here. 
Two examples are given for illustration purpose. The first example is 
illustrated in Fig. 2 where Q;(f) is chosen to be 


‘Meee 


7 ’ a le Sy ee ee ty eae ie 





i 
Q(f) = 5 COS 
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Fig. 2— First example of shaping the amplitude characteristic Ai(f) of the 
transmitting filters. 


This simple choice satisfies (14), (15), (18), and (19). Let C; be 4 for all 
7, then (14), (15), and (16) are all satisfied. From (18) 


AP(f/)H(f) = Ci + Q:(f) 


MBs i 
Je. = 





1 1 
= 5 1 5 C08 ® 
and 


RUG): cone! aa pape pepe 





This A,(f)H(f) is similar to the amplitude characteristic of a standard 
duobinary filter (except shift in center frequency). The second example 
is illustrated in Fig. 3 where Q;(f) is chosen such that A;(f)H(f) has a 
shape similar to that of a multiple tuned circuit. It can be seen from 
these two examples that there is a great deal of freedom in choosing 


Qi (F) A? (f) H? (F) Ai(f) H (F) 
i 
2 1 { 
0 
ale 
0 0 
fi- fy Fi fit fs fi-fs fe fitfs fi- fs fi Fitts 


Fig. 3— Second example of shaping the amplitude characteristic Ai(f) of the 
transmitting filters. 
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the shaping function Q,(f). Consequently, A;(f)H(f) can be easily shaped 
into various standard forms. A,(f) would have the same shape as 
A,(f)H(f), if H(f) is flat in the frequency band f; — f, to f; + fs of the 
ith channel. If H(f) is not flat in this individual band, A,(f) can be ob- 
tained from A,(f)H(f)/H(f), provided that H(f) # 0 for any f in the 
band. 

It is also noted from the preceding that if C; is chosen to be the same 
for all 7 and if Q,(f),7 = 1,2, --- , N, are chosen to be identically shaped 
(i.e., identical in shape except shifts in center frequencies), then 
A(f)H(f),7 = 1,2, --- , N, will also be identically shaped. Consequently, 
A.(f), 7 = 1, 2, --- , N, will be identically shaped if H(f) is flat or is 
made flat. An advantage of having identically shaped filter characteristics 
is that each filter can be realized by using an identical shaping filter plus 
frequency translation. 

H(f) can be made flat by using a single compensating network which 
compensates the variation of H(f) over the entire band. As an alternative, 
note that A;(f) exists only from f; — f, to f; + f,. Hence, for the 7th 
receiver, the integration limits of (6), (8), and (9) can be changed to 
f: — f, and f; + f,. Therefore, the signal at the 7th receiver only has to 
satisfy the theorem in the limited frequency band f; — f, to f; + f.. 
This permits one to design the transmitting filters for flat H(f) and then 
compensate the variation of H(f) individually at the receivers, i.e., use 
an individual network at the zth receiver to compensate only for the 
variation of H(f) in the limited frequency band f; — f, to f; + fs . 

Finally, note that if the channels are narrow, each channel will usually 
be approximately flat. In these cases, one may design the transmitting 
filters for flat H(f) without using compensating networks. This design 
should lead to only small distortion. 

Consider next the shaping of the phase characteristics a,(f) of the 
transmitting filters. It is only required in the theorem that (17) be 
satisfied. However, if it is desired to have identically shaped transmitting 
filter characteristics, one may consider a simple method such as that in 
the following corollary. 


Corollary 2: Under the simplifying condition that a;(f),7 = 1, 2,--- , N, 
be identically shaped, 1.e., 


ot41(f) = a(f ts te); t= 1, 2, oar vil et (20) 


(17) holds when 
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a(f) = har’ 5 + go + >» Gm COS Mar 


Ss 
fs 








i ee © 
f, 





+ >0 y, sin nr 
(21) 
m = 1, 2, 3,4, 5, --- 


n = 2,4,6,--- 
Pen S oP Ee Spe es 


where h is an arbitrary odd integer and the other coefficients (go 3 @m , M = 
1, 2,3, 4,5, +--+ 3%, = 2,4, 6, --+ ) can all be chosen arbitrarily. 

This corollary is proven in Appendix B. Note that if the index n in the 
corollary were not required to be even, a;(f) would be completely arbi- 
trary (a Fourier series with arbitrary coefficients). This shows that there 
is a great deal of freedom in shaping a;(f) even if the additional constraint 
of identical shaping is introduced (three-fourths of the Fourier coefficients 
can be chosen arbitrarily). The linear term ha[(f — f;)/2f.] is introduced 
not only to give the term +72 in (17), but also because a linear com- 
ponent is usually present in filter phase characteristics. A simple example 
is given in Irig. 4 to illustrate (21). For clarity, the arbitrary [Fourier 
coefficients are all set to zero, except Y2, and h is set to —1. 

As can be seen in the theorem, the requirement on a;(f) is independent 
of the requirements on A,(f). Hence, the amplitude and the phase char- 
acteristics of the transmitting filters can be synthesized independently. 
This gives even more freedom in designing the transmitting filters. 

A simple set of A,(f) and a;(f) is sketched in Fig. 5 for three adjacent 
channels. This illustrates that the frequency spectrum of each channel 
is limited and overlaps only with that of the adjacent channel. H(f) is 
assumed flat and the transmitting filters are identically shaped. As 
mentioned previously, these filters can be realized either by different 
networks or simply by using identical shaping filters plus frequency 
translations. 

Now consider the five requirements in Section I. The first requirement 
is satisfied since the transmitting filters designed are of standard forms 
(see the examples in Iigs. 2 and 3). Perpendicular cutoffs and linear 
phase characteristics are not required. 

As for the second requirement, it can be seen from Fig. 5(a) that the 
overall baseband bandwidth of N channels is (NV + 1)f, . Since data rate 
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Fig. 4— An example illustrating (21) (h = —1, y 2 ¥ 0, all other coefficients 
set to zero for clarity). 
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Fig. 5— Example of transmitting filter characteristics for orthogonal multi- 
plexing data transmission. 
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per channel is 2f, bauds, the overall data rate of N channels is 2f,N 
bauds. Hence, ; 


2N 
N+1 


_ N 

 N+1 
where R,.x , Which equals two times overall baseband bandwidth, is the 
Nyquist rate for which unrealizable filters with perpendicular cutoffs 
and linear phases are required. Thus, for moderate values of N, the 
overall data rate of the orthogonal multiplexing data transmission 
system is close to the Nyquist rate, yet rectangular filtering is not re- 
quired. This satisfies requirement (72). 

Now consider the third requirement. As has been shown, the received 
pulses are orthogonal if (6), (8), and (9) are simultaneously satisfied. 
Note that the phase characteristic n(f) of the transmission medium does 
not enter into these equations. Hence, the received signals will remain 
orthogonal for all 7(f), and adaptive correlation reception (see Appendix 
C) can be used no matter what the phase distortion is in the transmission 
medium. Also note that so far as each receiver is concerned, the phase 
characteristics of the networks in each receiver (including the bandpass 
filter at the input of each receiver) can be considered as part of 7(f), 
and hence has no effect on the orthogonality of the received signals. 

In the case of the fourth requirement, let 


by, k= 0,1,2,---; ¢= 1,2,---,N, 


overall data rate = X overall baseband bandwidth 





Fines 5] 


and 
co, k= 0,1,2,---; ¢*@=1,2,---,M 


be two arbitrary distinct sets of m-ary signal digits or real numbers to be 
transmitted by the VN AM channels. The distance in signal space between 
the two sets of received signals 


DD bi’ -us(t — kT) 
4 k 

and 
» > cy u(t — kT) 
4 k 

is 


a=|f [XE Lota — er) — LO Dohwlt — eryy at | 


1 
2 
. 
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In an ideal case where interchannel and intersymbol interferences are 
eliminated by transmitting the signals of each channel through an inde- 
pendent medium and slowing down data rate such that the received 
signals in each channel do not overlap, the distance d can be written as 


didenl = b a | (bf Se a = rat) 
a k — OO 


In this study, the N channels transmit over the same transmission 
medium at the maximum data rate T = 1/2f, . If the transmitting filters 
were not properly designed, the distance d could be much less than 
diaea, and the system would be much more vulnerable to channel noises 
(impulse and Gaussian noises). However, if the transmitting filters are 
designed in accordance with the theorem in Section II, the received 
signals will be orthogonal and d = digca;. Thus, the distance between 
any two sets of received signals is preserved and the same distance pro- 
tection is provided against channel noises. For instance, since d = dideai , 
it follows from maximum likelihood detection principle that for band- 
limited white Gaussian noise and m-ary transmission the receiver will 
receive each of the overlapping signals with the same probability of error 
as if only that signal is transmitted. 

Note further that diaca: can be written as 


daa = [OE Oo — os)? [ accorcnar]. 


Thus, diaca1 18 independent of the phase characteristics a;(f) of the trans- 
mitting filters and the phase characteristic y(f) of the transmission 
medium. Since d = diaea:, it follows that d is also independent of a;(f) 
and n(f) and the same distance protection is provided against channel 
noises for all a;(f) and n(f). 

Finally, consider the fifth requirement. It is assumed in this paper that 
signaling intervals of different channels are synchronized. However, it is 
interesting to point out that the frequency spectra of alternate chan- 
nels (for instance, 7 = 1,3,5, ---) do not overlap (see Fig. 5). Hence, if 
one uses only the odd- or the even-numbered channels, one can transmit 
without interchannel and intersymbol interferences and without syn- 
chronization among signaling intervals of different channels.* The 
overall data rate becomes 3 Rmax for all N. A very attractive feature is 
obtained in that the transmitting filters may now have arbitrary phase 
~ * For instance, signal digits are applied to the 7th transmitting filter at 0, T, 


PERS i aa while signal digits are applied to the (¢ + 2)th transmitting filter at 
7,0 +7,2T + 7,387 +7,°°: , where 7 is an unknown constant. 
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characteristics a,(f). [This is because a,(f) is not involved in (6) and 
intersymbol interference is eliminated for all a;(f).] Thus, only the ampli- 
tude characteristics of the transmitting filters need to be designed as in 
the theorem and the transmitting filters can be implemented very easily. 

Another case of interest is where part of the channels are synchronized. 
As asimple example, assume that there are five channels and that channel 
1 is synchronized with channel 2; channel 4 is synchronized with channel 
5; while channel 3 cannot be synchronized with other channels. If the 
amplitude characteristics of the five channels plus the phase character- 
istics of channels 1, 2, 4, and 5 are designed as in the theorem, one can 
transmit simultaneously through channels, 1, 2, 4, and 5 or simultane- 
ously through channels 1, 3, and 5 without interchannel or intersymbol 
interferences. The overall data rate is then between $Rmax and 


CN GN Ae ) Binass 


IV. CONCLUSION 


This paper presents a principle of orthogonal multiplexing for trans- 
mitting N(N 2 2) AM data channels simultaneously through a linear 
band-limited transmission medium. The channels operate on equally 
spaced center frequencies and transmit at the same data rate with 
signaling intervals synchronized. Each channel can transmit binary 
digits, m-ary digits, or real numbers. By limiting and stacking the fre- 
quency spectrums of the channels in a proper manner, an overall data 
rate of 


2N 
N+1 


is obtained which approaches the Nyquist rate when JN is large. Inter- 
channel and intersymbol interferences are eliminated by a new method 
of synthesizing the transmitting filter characteristics (j.e., designing 
band-limited orthogonal signals). The method permits one to synthesize 
a large class of transmitting filter characteristics in a very convenient 
manner. The amplitude and the phase characteristics can be synthesized 
independently. The transmitting filter characteristics obtained are 
practical in that 
(¢) The amplitude characteristics may have gradual rolloffs, and the 
phase characteristics need not be linear. 
(iz) The transmitting filters may be identically shaped and can be 
realized simply by identical shaping filters plus frequency transla- 
tions. 


X overall baseband bandwidth bauds 
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It is noted that the principle presented in this paper uses band-limited 
orthogonal signals as opposed to other orthogonal multiplexing schemes 
using nonband-limited orthogonal signals. The chief advantage of using 
band-limited signals is that (as mentioned in Section I) these signals can 
be transmitted through a band-limited transmission medium at a maxi- 
mum data rate without interchannel and intersymbol interferences. 
Other advantages of using band-limited signals over methods using 
nonband-limited signals are 

(_) Permitting the use of a narrowband bandpass filter at the input 
of each receiver (see Appendix C) to reject noises and signals 
outside the band of interest. This is particularly important in 
suppressing impulse noises and in preventing overloading the 
front ends of the receivers. 

(7) Permitting unsynchronized operations at data rates between 

$Rmax and (V/N + 1)Rmax - 

It has been shown that the received signals remain orthogonal for all 
phase characteristics of the transmission medium; hence, adaptive cor- 
relation reception can be used to separate the received signals no matter 
what the phase distortion is in the transmission medium. These correla- 
tors adapt not only to the phase distortions in the system (including 
transmission medium, bandpass receiving filters, etc.), but also (see 
Appendix C) to the phase difference between modulation and demodula- 
tion carriers (easing synchronization requirements). 
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APPENDIX A 


In this appendix, it will be proven that if the transmitting filters 
A.(f) exp [Ja.(f)], 7 = 1, 2, --- , N, are shaped as in the theorem in 
Section II and f; is set according to (10), then equations (6), (8), (9), 
and (12) are simultaneously satisfied. 

First consider (6). From (13) 


ee) fits 
i Ai(f)H’(f) cos 2afkT df = he IC; + Q:(f)] cos 2xfkT df. (22) 


Since 7 = 1/2f, , one has 
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fitts C. 
/ C; cos 2afkT df = — = [sin 2r(f; + f,)kT 
fits QarkT 


— sin 2r(f; — f,)kT| 


ee oe (23) 
= oop sin ak cos 2rf:kT 
= 0, k = 1, 2,3, --- 
t= 1,2,3,---,N. 
Since f; = (h +7 — $)f,, one has 
‘) 1 
Qr (F —"2) kT = Qr(h +7 — 1)f,.k = 
2 2fs (24) 


(h + a — 1)kr. 


Hence, cos 2afkT is an even function about f; — (f,/2). This, together 
with the fact that Q:(f) is an odd function about f; — (f,/2) [see (15)], 
gives 


Si 
/ Q:(f) cos 2afkT df = 0, k = 1,2,3,--- 
SiS s 


(25) 
7=1,2,---,N. 
Similarly, one can show 
Lith s 
i Q:(f) cos 2xfkT df = 0, ke = 1.259. es 
7 (26) 


ae eee 
Substituting (23), (25), and (26) into (22) gives 


is Ai (f)H'(f) cos 2afkT df = Q, k= j, 2, 3, Sts 


i= 1,2,3,-::,N. 


Thus, (6) is satisfied and intersymbol interference is eliminated. 
Next consider interchannel interference and (8) and (9). From (13), 


SO 
A(PA MHA) = 0 for j =t4+ 2,74 3,74 4,---, 
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or 


[ “APMORS cik) a lester a =o 


[ ACNAA NECN sin laf) — a4(f)] sin 2afkT df = 0 
(27) 
k = 0,1,2,--- 


ae eee 
Pe Dg Ste og Sie At lus as, 


Equation (27) shows that (8) and (9) are satisfied for 7 = 7 + 2,7 + 3, 
41+4,---.Itremains to show that (8) and (9) hold for 7 = 7 + 1. Con- 
sider 7 = 7 + 1. It is seen from (13) that 


APAm(NHf = (Ce + QAHCu + QuaNh, fi<f <fi tf 
= 0, P< Sey ode We Jax (28) 
One can write from (17) and (28) 


[ "Ad f)Aaa(fYFP(S) cos lax(f) — ovsa(f)] cos QafhT af 


Lists 4 ‘ 7 
= ih (C2 + QA 1Cur + Qii(f)]’ cos | 3 ss w(d) | 


(29) 
-cos 2afkT df 


k = 0,1, 2,--- 
a= 1,2,---,N. 
It is required in the theorem that 
[Ce + Q(T Cir + Qin) 


be an even function about f; + (f,/2). Furthermore, cos [-(a/2) + 
vi(f)| and cos 2afkT are, respectively, odd and even functions about 
fit (f./2). Hence, from (29) 


‘a Ai(f)Ana(f)H'(f) cos [ai(f) — oiss(f)] cos 2afkT df = 0 


k = 0,1,2,--- (30) 
Pe, Died Ne 
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Equation (80) shows that (8) is satisfied for 7 = 7 + 1. In a similar 
manner, one can show that (8) holds for7 = 7 — 1 and that (9) holds 
for7 = 71+ 1. These, together with (27), prove that (8) and (9) hold for 
all k, 7, and j. 


APPENDIX B 
Proof of Corollary 2 
From (20) and (21) 


arf) = ailf — fs) 
pe PII 


2fs 
+ Lem cos mel = 
oF Lin sin not aot (31) 
m = 1,2,3,4,5,--- 
n = 2, 4,6, -:- 


fi of < fi + fr. 
For f; < f < fi +f., one has from (21) and (81) 


St eae fan Ges Pee | 


h 
a;(f) = aisa(f) = Of, 





+ > Om | cos hig = f fi 


— cos pe Mess Leet fe at 





2 didn sin eee ites F. fi 


‘cot =H 
fs 


— sin nr 
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=" + Deal - 2 sin £(m Pa ah) sin | 


oe Dv. [2 sin“ cos &(n ptt f)| 


fs 
(32) 
=F 2D sine sin mf = Bhs — Ff) 
2f 
LT = 1,3,5,-:: 
h= +1, +3, eee 


Since sin [lr(2f — 2f; — fs)/2f.] is an odd function about f = f; + (f./2), 


(82) is equivalent to (17) and corollary 2 is proven. 


APPENDIX C 


This appendix briefly describes a possible receiver structure for receiv- 
ing the multichannel orthogonal signals. 

The receiver of a single channel (say, the fifth channel) is shown in 
Tig. 6(a). When viewed at point B toward the transmitter, the channels 
have amplitude characteristics as shown in Fig. 6(b). The bandpass 
filter at the input of the fifth receiver has a passband from fs — f, to 
fs + fe [Fig. 6(c)]. This filter serves the important purpose of rejecting 
noises and signals outside the band of interest. Sharp impulse noises with 
broad frequency spectra are greatly attenuated by this filter. Signals in 
other channels are rejected to prevent overloading and cross modulation. 

The product device translates the frequency spectra further toward 
the origin so that the signal can be represented by a minimum number of 
accurate time samples and the adaptive correlator can operate in digital 
fashion. The transmitter can transmit a reference frequency f, or a known 
multiple of f; to the receivers for deriving the signals cos [27(¢ — 1)f.¢ + 
6,;] for the product devices. It is important to note that the transmitter 
can lock this frequency f, to the data rate 2f, so that the arbitrary phase 
angle @; is time invariant and can be taken into account by adaptive cor- 
relation. Furthermore, the receiver can also derive the sampling rate 
2f, from this reference frequency. 

When observed at point D, the channels have amplitude characteristics 
as shown in Fig. 6(d). Note that the fifth channel now has a center 
frequency at 1.5f, [satisfying (11)] and an undistorted amplitude char- 
acteristic; hence, the signals in channel 5 remain orthogonal. The over- 
lapping frequency spectra between channel 5 and channels 4 and 6 
remain undistorted, and the phase differences au(f) — as(f) and as(f) — 
as(f) are unchanged; therefore, the signals in channels 4 and 6 remain 
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TRANSMISSION |_8_| BANDPASS LOWPASS ADAPTIVE (a) 
MEDIUM FILTER FILTER CORRELATOR 
cos [2m(t-1)fst+0i], i=s 
(b) 
(c) 
f 
fe) fe 
4 5 6 
(d) 
f 
fe) 1.5, 


Fig. 6 — Reception of the signals in channel 5. 


orthogonal to those in channel 5. Other channels produce no interference 
since their spectra do not overlap with that of channel 5. 

Let bou(t), biu(t — T), bou(t — 27), --- be the signals in channel 5 at 
point D, where bo, bi, b2, --+ are the information digits. These signals 
can be represented by vectors of time samples as 


boto ) buy 3 bette ee 


Since u(t), u(é — T), --- differ only in time origin, it is only necessary to 
learn uo for correlation purposes. The received signal at point D can be 
written as 


De bin + 2, 


where v represents the sum of the signals in other channels. From discus- 
sions in the preceding paragraph 
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Un'U; = 2X k=j 
=0 k#¥j 
uk v = 0. 


Thus, the adaptive correlator can learn the vector uo prior to data trans- 
mission and then correlate the received signal with uz, k = 0, 1, 2, 
to obtain the information digits b, , k = 0, 1, 2, 

In order to describe the operation more clearly we assume that the 
signal at point D is fed to a delay line tapped at 7'/3-second intervals 
(signal at D is band-limited between 0 and 3f,). Assuming that w(t) 
is essentially time-limited to mT seconds for all possible phase character- 
istics of the transmission system, then 3m taps are sufficient. The 7th 
tap is connected to a gain control G; . In the training period prior to data 
transmission, the 7th tap is also connected to a sampler s; . In the training 
period, the transmitter transmits a series of identical test pulses at ¢ = 0, 


IT’, 21T, --- . The integer / is chosen large enough such that the received 
test pulses u(é), u(é — IT), u(t — 21T), --- do not overlap. The sampler 
s; samples at ¢ = 7,17’ + 7, 2IT + 7, ---. The only requirement on r is 


that u(t) should be approximately centered on the tapped delay line at 
t = r. The output of s; (without noise) is a series of samples each repre- 
senting the 7th time sample wu; of u(t). Since noise is always present, these 
samples are passed through a network (probably a simple RC circuit) 
such that the output @; of this network is an estimate of u;-@; is in the 
form of a voltage or current and hence can be used to set the gain control 
G,; of the 7th tap. Thus, at the end of the training period, the gain con- 
trols of the successive taps are set according to the magnitudes of the 
successive time samples of u(t). 

During data transmission, the transmitter transmits the information 


digits bo , b1, bs, «++ sequentially at ¢ = 0, 7, 27, ---. A sampler at 
the receiver samples the sum of the outputs of all the tap gain controls at 
t=7,T +7,2T +7, --- to recover bo, bi, be, --- . The time delay 7 


remains the same as in the training period. The data transmission oper- 
ates in real time. 
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Avalanche Region of IMPATT Diodes 


By H. K. GUMMEL and D. L. SCHARFETTER 
(Manuscript received August 10, 1966) 


The avalanche region of an IMPATT (IMPact ionization Avalanche 
Transit Time) diode under small signal conditions ts characterized by the 
fraction of the total alternating current that is carried by holes and electrons 
in their respective drift spaces and by a residual impedance. The current 
fractions are roughly in phase with the total current below, and nearly 180° 
out of phase above a resonance frequency that 1s proportional to the square 
root of direct current density. 

This paper extends the calculations of Gilden and Hines for the current 
fraction to include phase shifts in the avalanche region so that extended 
avalanche regions can be considered. Realistic values (a # 8 in St) for the 
tonization coefficients are used. Results of detailed numerical calculations 
for the current fractions as a function of frequency and direct current den- 
sity are presented. 

For typical frequencies and current densities, the residual impedance 1s 
negligible and hole and electron current fractions are equal. The avalanche . 
region at a given frequency and current density 1s then characterized by 
one complex number and the admittance of a diode containing the avalanche 
region and adjacent drift regions is easily calculated. Plots showing the 
admittance as a function of frequency and current density for typical struc- 
tures are given. 

It ts found that an optimal exponential growth rate of oscillations is ob- 
tained when the current density 1s such that the resonance frequency is about 
equal to one half the reciprocal transit time through the longest drift region. 
If the assumption 1s made that conditions giving the largest small-signal 
exponential growth rate give the best large-signal performance, an optimum 
Read-diode design is obtained for which the avalanche region width is a 
substantial fraction (= 4) of the drift region width. 


I. INTRODUCTION 


This paper considers the avalanche region of IMPATT* diodes,! 73 
especially of Read!-type diodes in which the avalanche region is localized. 


* [MPact ionization Avalanche Transit Time. 
1797 
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Recent theoretical* and experimental® results for the impedance of Read 
diodes show considerable structure in the current and frequency depend- 
ence. This paper attempts to enhance the understanding of the small- 
signal negative resistance of IMPATT diodes by isolating the role of the 
avalanche region. Using the calculated parameters by which the ava- 
lanche region is characterized, the admittance for typical IMPATT 
diodes is calculated and exponential growth rate for oscillations is stud- 
ied. An application of the results to the design of Read diode oscillators 
is made. 

At a given angular frequency w and ac terminal current density 
Itot through the diode, the avalanche region can be characterized by 
three complex numbers: Z,, I, F. defined below. In Fig. 1, to the 
right of the avalanche region the small-signal particle current consists 
of a plane wave of holes. If we assume a constant drift velocity, then 
the magnitude is constant and the phase changes linearly with distance. 
Let Ax be the complex amplitude, extrapolated to a phase reference 
plane at 2» in the avalanche region. Then we define the hole current 
fraction F, 


F, = An/itot - (1) 


Similarly, we define the amplitude of the electron current density, 
extrapolated to a, to be A, and we define the electron current fraction 
F, 


Ff, — Ail tees (2) 
I DRIFT DRIFT 
END: te». SEMIN cee ae eee __.) END 
REGION K ae seg ‘| REGION 
| L R 
| AVALANCHE | | 
| — REGION = | 


ELECTRONS 
<_— 





PHASE 
REFERENCE 
PLANE 


Fig. 1 — Schematic of IMPATT diode. 
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In the limit as the frequency goes to zero, all the current through the 
device is particle current and fF, and F, are of equal magnitude. The 
phase reference plane is chosen so that at low frequencies /;, and F, 
are of equal phase also. The phase reference plane is approximately 
located at the point where the hole and electron currents are equal. 
Assume for a moment that the avalanche region were infinitely thin. 
From the fact that the total current equals the particle current plus 
displacement current we find that the ac field per total ac current, 
to the left of xo is 
Eo 1—t/iss 1 — F, exp [tw(x — 2x)/v.] (3) 
= : GN Ss oe ee ee ge oF 


ttot UWE 1WE 





where 2, is the electron current density and v, the electron velocity. 
The impedance, Zz between x, and 2 (see Fig. 1) is obtained by 
integration of (3). 





Zn = [1 — 7, Lap tere) |, (4) 

tw 1 IWTL 

where 7, is the transit time between zp and x, 
ee to — Uy (5) 


Ve 


and Cz, is the geometric capacitance per unit area, 


C, = a (6) 


fi — ae 





A corresponding expression obtains for Zor , the impedance between 
XY and vp. 

In deriving (4) it was assumed that the electron current is of the 
form 


1, = F, exp [iw(a — 2X0) /v¢l. (7) 
This is true only outside of the avalanche region. The difference between 
the actual and asymptotic hole and electron current densities gives rise 


to an additional, residual impedance Z,. The total impedance, exclu- 
sive of end-region resistances, is then 


Zio. = Zr + Zio + Zor. (8) 


Typically, the residual impedance term is small in comparison to the 
other terms. Also, for narrow structures and at sufficiently low fre- 
quencies such that v/w is large compared to the width of the avalanche 
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region, the hole and electron current fractions are equal in value and 
will be denoted by F without subscript. 

In Section II we consider, following Gilden and Hines,° the conse- 
quences of a simplified model of the avalanche zone, as embodied in 
Read’s (13), (Ref. 1) and derive an expression for Ff from it 


1 


p22): (9) 


Here the quantity f. = w,/2, which we shall call the critical frequency, 
is real, independent of operating frequency w/2zm and is proportional to 
the square root of the de current density. 

Section III presents a treatment that is rigorous except for the 
idealizing assumption that hole and electron velocities are constant 
and equal in value. Numerical results for several structures are given 
in Section IV, and it will be shown that for frequencies f and direct 
current densities of interest / can be represented as 


(if — B) AA 
"= Cp AG — AD (44), no 


where B is a zero on the real axis and is nearly independent of current 
density and A and A are complex conjugate poles that traverse a para- 
bolic path in the complex frequency plane as the current density is 
changed. 

In Section V a qualitative description of the change in negative 
resistance characteristics is given as a transition is made from p-i-n 
to p-n to “Read” diode. Admittance as function of frequency and cur- 
rent density 1s shown for two specific structures. Values for the expo- 
nential growth rate factor g(= — 1/2Q), maximized with respect to fre- 
quency, are shown as a function of current density. It is found that an 
optimal growth rate is obtained when the current density is such that 
the resonance frequency is about equal to one half the reciprocal transit 
time through the longest drift region. 

In Section VI we explore the consequences of the assumption that 
best large signal performance is obtained for the same conditions that 
yield largest small signal growth rate. This assumption in conjunction 
with a constraint expressing drift-region output limitations leads to 
Read-diode designs in which the width of the avalanche region is a 
substantial fraction (~ 4) of the drift region. 


F= 
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II. SIMPLIFIED MODEL 


The current fraction / can be deduced’ from the simplified theory 
of Read.’ His (13) for a thin avalanche region 


m1d10 _ i eS 
> ah = 1 adx i)+l. (11) 
states that the time derivative of the particle current density through 
the avalanche region, J) , equals the product of Jo and a field dependent 


actor h(E) 
h(E) = >({ ae 1) (12) 


if the saturation current J, can be neglected, as is usually the case. 
Under de conditions h(Z/) = 0. If 2, and e represent small signal ac 
components at frequency w of particle current density and electric 
field in the avalanche region, then 

iaip = Te he. (13) 
The total alternating current density ?.4 equals the sum of particle 
current density and displacement current density, 


ttot = OF + 1WEE (14) 
or, with the definition 
Petal ees (15) 
eee Gen oe (16) 
tot WWE 


Division of (13) by zo1 and substitution of e from (16) yields 





1 
R= ( w ; (17) 
1 -, 
g Werit 
with 
oh 
Werit = Qaferit = Io a / ¢ (18) 


Equations (17) and (18) predict a pole in the impedance or a zero 
in the admittance at a critical frequency ferit that is proportional to the 
square root of the current density. Experimental measurements’ and 
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numerical calculations’ have shown that in p-n diodes and p-i-n diodes 
the admittance goes through a minimum, but not through zero, at 
frequencies approximately proportional to the square root of current 
density. This indicates that the simplified theory based on (11) is 
roughly valid but not accurate in detail. In the next Section a treatment 
is presented that does not make the assumptions inherent in (11) and 
that is, therefore, applicable to wide as well as narrow avalanche re- 
gions. 


III. IMPROVED MODEL 


In this section we give a more rigorous treatment of the avalanche 
zone. The present treatment is facilitated greatly by the idealizing 
assumption that hole and electron velocities are constant and equal in 
value. If this assumption is not made the problem is still tractable and 
a method of solution was obtained.® However, it is felt that a somewhat 
idealized treatment with the attendant reduction in complexity and, 
hopefully, gain in physical insight, is worthwhile. This is especially the 
case since experimental values for particle velocity vs electric field are 
available only for prebreakdown fields. Though electron velocities in 
silicon, the material of present greatest interest, can be extrapolated 
reasonably well into the breakdown region, considerable uncertainty 
prevails about hole velocities. However, since in silicon the electrons 
ionize much more strongly (roughly a factor of 10 more) than holes, it 
is felt that the results of calculations for the current fractions F are not 
affected significantly by the choice of hole velocity. Thus, the assump- 
tion that the hole velocity equals the electron velocity appears adequate. 
The value of 10’ cm/sec is used for the velocity v. 

For the ionization coefficients a and @ of holes and electrons as func- 
tion of electric field # the expressions 


1.8 X 10’ (em™) exp (—3.2 X 10° (V/em)/E) 
B = 2.4 X 10° (em) exp (—1.6 X 10° (V/em)/E) 


are used. The numerical values refer to silicon at room temperature 
and are based on the work of Lee et al.” 

Let J, and J, be the hole and electron current densities. Then the 
continuity equations state 


si n= —I + al, t Ble (19) 
10 , 
Ie = Ie + aln + Ble (20) 


v dt 
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where primes denote spatial derivatives. Here we assume that holes 
move in the +<2 direction and electrons in the —2 direction correspond- 
ing to a positive electric field as in an n-p structure. We introduce the 
total electric current associated with particle motion 


Is=Ie+TIh, (21) 
and the difference in currents 
Lhe Dye Tis (22) 
Addition of (19) and (20) yields 
1a 
~ giz — da + (8+ az + (8 —a)Ia. (23) 
From Poisson’s equation 
Ih = —veE’ + No, (24) 


where Jf is the electric field and gNp the net impurity space charge. 
We shall now again consider small ac quantities at frequency w, de- 
noted by lower case symbols, superimposed on de quantities and we 
denote by ttct the total ac current. From the continuity of total current 
we obtain 


Trot = tz + twee. (25) 
We define 
k = tw/v (26) 


and normalize the ac electric field to the total alternating current 


eg (27) 
ttot 
Then (25) can be written 
ty 
— = 1 — kz. (28) 
Lot 


We introduce the derivative with respect to field of the last two terms 
of (23) 


1 0 0 
t= 1) eZ o+a+he@-a)| 


: (29) 
Then the small signal ac version of (23) is 


—2"+(a+p—k) — kz) —- (@-—a)e +He=0 (30) 
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or 
[ID —- kh + (B@+ta)kk+ B@-—a)D-—He=at+B-—k, (81) 


where D is the spatial derivative operator, and where a, 8, and H are 
evaluated for the average, or de, field as a function of distance. 

Equation (31) is a second-order differential equation for the complex 
quantity z. We postulate that to the left of the avalanche zone all 
particle current is electron current, 1.e., 


ty = plane wave moving to left (32) 


and that to the right of the avalanche zone all particle current is hole 
current 


iz = plane wave moving to right. (33) 


We do not specify that the magnitude of the ac electron current on the 
left equals that of the hole current on the right. Since we differ in this 
point from previous treatments, some discussion may be warranted. 

Holes and electrons are generated in pairs. One might, therefore, 
be led to conclude that the magnitudes of hole and electron particle 
currents must be equal. This is true for the de or average currents but 
not generally so for the ac currents. Consider that a periodic generation 
rate g(x) is given. We allow g to be complex to represent the variation 
in phase with distance. The continuity equations (19) and (20) for 
the small signal case may be written 


k+ D)t = 9 (34) 
(k — D)i. = g. (35) 


With g considered given, these are first-order differential equations for 
7, and 7,. With the boundary conditions (32) and (33) the solutions are 


in() = | 9(s) exp [-ba — 8)]ds (36) 


is(x) = [° o(s) exp [+a — 8)] ds (37) 


Since k is purely imaginary and g is complex, it is seen that 7,(v,) and 
ie(x,) need not be the same. For narrow structures and at low fre- 
quencies, the difference in magnitude is negligible. This, however, is 
the result of the calculations and not imposed as a constraint. 

We specify, then, that outside the avalanche region the particle 
current consists of a plane wave moving to the left or right. 
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= = Ap exp (—kz) right (38) 
tot 
a = A, exp (+k2). left (39) 
tot 


The amplitudes Az and A, are as yet undetermined. By (28) the 
electric field outside the avalanche region is 
oe rl — Ap exp (—kz)] right (40) 
Be ar 2 Aken es) ich. 27) 
and we obtain the boundary conditions 
zg =1—ke right (42) 
2 = —(1 — kz) left (48) 


in accordance with (22) and (24). 

These boundary conditions specify completely the solution of (31). 
If the solution for z is carried over the entire depletion region, then the 
integral over z gives the total impedance, exclusive of resistive losses 
in the end regions. However, in order to isolate the role of the ava- 
lanche region and to bring the impedance into the form (8) we proceed 
as follows: If a solution for z has been obtained at some sufficiently 
low frequency, then the magnitudes of Ar and A, in (88) and (389) 
are equal. A value x» exists at which the right hand side of (88) equals 
the right hand side of (89); 1.e., at which the asymptotic particle cur- 
rents, extrapolated into the avalanche zone, are equal in phase as well 
as magnitude. We call xz» the phase reference point and define the 
extrapolated particle currents at this point as the current fractions 
F, and F;, ; 


F, = Ar exp (—kao) (44) 
F.= Az,exp (+ka). (45) 
We define the asymptotic particle current 2, , 


F, exp [—k (a — 20)] xz Xo 
ta/teot = (46) 
F, exp [+k(« — 2)] x < 2X 


and the residual particle current 7, as the difference between the actual 
particle current 7,.:(1 — kz) and the asymptotic particle current 
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Va 





1 
tot tot 


(47) 


Note that by construction 7, vanishes outside the avalanche zone and 
is thus independent of the dimensions of the drift zones, as are I’, and 
F, . We can now solve (47) for z, 


IV 


F,, exp [—k(x — 2o)] 1 a 
1-—- (48) 
F. exp [+k(a — 20)] ae Gin 


The integral of z, multiplied by 1/ve = k/ (twe) is the total impedance 


2 





io Xo = Ly 1 =i —exp oan) 
1w€ LWT 1 
| (49) 
eS a = p, $—ep Gere) | 
LWE WTR 
= Z, + Zio + Zor, (50) 
with 

Fees ae (51) 

WWE Ltot 


We shall use the symbol F without subscript to refer to either Ff, or 
F’,, if their difference is negligible or the distinction unimportant. 


IV. NUMERICAL RESULTS 


This section contains numerical results for the following structures in 

silicon: 

D3 error function complement p-diffusion from a surface con- 
centration of 10” em into a 3 X 10'° cm” n-type substrate 
with a junction depth of 3 microns. 

D12_ similar to D3, but with a 12-micron junction depth. 

L22 a linearly graded junction with a concentration gradient of 
10” impurities/cem*. 

CO.1 Constant field avalanche zone (field = 5.71 X 10° volts/cm) 
of 0.1-micron width, surrounded by regions of sufficiently low 
field that negligible avalanching takes place there. 

Cl Similar to C0.1, but with 1-micron width; field = 3.57 X 10° 
volts/em. 

C5 Similar to C0.1, but with 5-micron width; field = 2.79 X 10° 
volts/cm. 
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C10 Similar to C0.1, but with 10-micron width; field = 2.52 X 10° 

volts/cm. 

For each desired direct current density J: , self-consistent values of 
de electric field /, difference current density J, , ionization coefficients 
a and 8, and derivative quantity H, (29), were computed as described 
in Appendix A. Using these quantities, the differential equation (31), 
subject to boundary conditions (42) and (43), was solved numerically 
for k-values corresponding to frequencies of interest. 

Figs. 2 and 3 show the results for the quantity 1 — 1/F = (—dis- 
placement current/particle current) for direct current densities of 100 
and 1000 amps/em*. According to the simplified model of Section II, 
1 — 1/F should have a real part varying as the square of frequency 
and a vanishing imaginary part. As Figs. 2 and 3 show, the square 
law is obeyed quite well at low frequencies and current densities. Only 
for the wide structures and/or at high frequencies and current densities 
does F’ deviate from the square law and do F,, and F’, deviate from each 
other appreciably. The frequency ferit at which (1 — 1/F’)reai 18 unity 
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Fig. 2— Real part of 1 — 1/F as function of frequency for various structures. 
Current density 100 amps/cm’. 
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Fig. 3— Real part of 1 — 1/F as function of frequency for various structures. 
Current density 1000 amps/cm’. 


is in good agreement with values obtained from the simple theory of 
Section IT. 

The absolute value of the imaginary part of 1 — 1/F at a current 
density of 100 amp/cm’ is plotted in Fig. 4 as a function of frequency f 
for the various structures. The current dependence of the diffused diode 
D3 is shown in Fig. 5; that for the other structures is similar. The 
vertical line on each curve to the right of the maximum indicates the 
frequency at which the real part goes to zero. At frequencies 15 to 25 
percent higher, (1 — 1/F')ima, goes through zero, being positive for 
lower and negative for higher frequencies. Asymptotically, at low 
frequencies (1 — 1/F’) imag is proportional to +f and at higher frequen- 
cies to —f’. The similarity of the curves is striking and suggests that a 
fit with a few parameters ought to be possible. The most meaningful 
expansion, and one working well here, is to consider F as an analytical 
function of a complex frequency variable and to expand it in terms of 
the poles and zeroes nearest the frequency domain of interest. At cur- 
rent densities of interest and for frequencies f from zero to a few times 
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ferit » the numerical results can be represented well by two poles and a 
zeYo; 


___(f—B) AA 
(if — A)(f — A) -B’ 


4 


(52) 





where ~ denotes the complex conjugate. A is the complex resonance 
frequency 


A = f, + if, . (53) 
We denote 
B= -f.. 


We utilize the following inequalities which hold for current densities of 
interest: 


la Ke (54) 
fr Kfe (55) 
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Fig. 4— Imaginary part of 1 — 1/F as function of frequency for various struc- 
tures. Current density 100 amps/cm?. The vertical bars indicate ferit. 
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Fig. 5— Imaginary part of 1 — 1/F as a function of current density for diode 
D3 at 100 and 1000 amps/cm’. 


and we consider frequencies small compared to f,. Then the reciprocal 
of (52) can be written 


a a ue _ f.) 
pein te ee 


Equation (56) has the frequency dependence embodied in Figs. 2 
through 5, i.c., the real part of 1 — 1/F changes as frequency squared 
and the imaginary part is the difference of a linear and cubic term in 
frequency. The low-frequency asymptote of the imaginary part is 


(l a= 1/F ) imag = S/he; (57) 





with 
ce 
1 + 2f.f-/f. 


The frequency, ferit , at which the real part of 1/F vanishes equals 
fr: 


Ja = (58) 
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Seri = Ts (59) 
and the frequency f; at which the imaginary part of 1/F vanishes is 
di aa Ve = ale . (60) 


Now let us consider the dependence of F on the direct current density 
I. Three relations hold approximately for the curves of Figs. 2 through 
5. 

The critical frequency ferit is proportional 
to the square root of direct current density. (61) 
The low-frequency, linear asymptote of (1 — 1/F)imaz 
is independent of current density. (62) 
The ratio of the frequencies fori, and f; at which 
the real and imaginary parts of 1/F cross zero 
is independent of current density. (63) 

We show now that these relations lead to the following current 

density dependences: 


fax I (64) 
fr « VI (65) 
f. independent of I. (66) 


Equation (65) follows directly from (59) and (61). Taking the ratio of 
(60) and (59) and using (63) we find that 


faf-/fr is independent of J. (67 ) 


Use of (62), (57), (58), and (67) yields (66). Finally, from (65), 
(66), and (67) we obtain (64). Thus, as long as the inequalities (54) 
and (55) and relations (61) through (63) hold, the current density 
dependence of F can be expressed as follows (see Fig. 6): With increas- 
ing current density the poles A and A of F move through parabolas, starting 
at the origin, while the zero at —f, 1s independent of current density. Where 
F,.and F,, deviate from each other, the representation (52) is still usable. 
The poles A and A are the same, but F, and F;, have separate zeros at 
—fee and —fen . 

The above discussion shows how the poles and zero can be obtained 
from calculated results of F as a function of frequency and current den- 
sity. In Appendix B a more direct way of obtaining the poles as solutions 
of an eigen-value problem is presented. The results are shown in Fig. 6, 
where the poles are plotted in the complex frequency plane with cur- 
rent density varying along the curves. The zeroes are listed in the insert. 
Only positive imaginary values are shown. Symmetry exists about the 
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Fig. 6— Locus of poles of F as function of current density in the complex 
frequency plane. Note unequal scales of real and imaginary axes. 


real axis. Note that the scales of real and imaginary axes differ by a 
factor of 10. 


V. NEGATIVE RESISTANCE AND MAXIMUM GROWTH FACTOR 


In this section, we consider the interaction of the avalanche and drift 
regions in causing negative resistance. The most important aspect of 
the current fraction F is the rapid change of phase, with respect to total 
current, from near zero below, to more than 180° above, the resonance 
frequency. This behavior follows from (52) and is illustrated for struc- 
ture D3 in Fig. 7 where the phase gy» of F is shown for current densities 
of 100 and 1000 A/cm?. At the higher current density the transition is 


IMPATT DIODES 1813 


more gradual. Likewise, for wider structures the transition is more 
gradual than for narrower structures. 

Let us consider a diode in which the hole drift region is negligible 
and for which the residual impedance Z, is negligible so that the dom- 
inant term in the impedance is 


Gin ise sah E ie pia ew (riers) |, (68) 


twC z, IwTL 


We want to explore under what conditions the small signal impedance 
has a negative real part. This will occur when the quantity 





1 — exp (—twrz) 
wry 


F = FG (69) 
has a phase angle between 0 and 180°. gg, the negative of the phase of 
G, is plotted in Fig. 8 as a function of frequency. We define as drift 
frequency, faritt , the frequency at which the phase gg is 90°, 


v 1 


2 (x0 = Lz) = Orr, ; (70) 


f drift = 
The singularity at twice the drift frequency occurs where G goes to 


zero and the phase is indeterminate. If the operating frequency is below 
1/rz we refer to the diode as operating in the + mode. The mode- 
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Fig. 7 — Phase of F for diode D8 at 100 and 1000 amps/cm?. 
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Fig. 8 — Negative of drift delay phase. 


designation for higher frequencies is given in Fig. 8. The condition on 
the phases for negative resistance can now be stated 


0 < or — ve < 180° (71) 


and can easily be visualized if Figs. 7 and 8 are combined. This has been 
done in Fig. 9 where a drift frequency of 60 GHz appropriate for the 
depletion layer of diode D3 has been used. Starting with the 100 A/cm? 
curve for gr, we see that negative resistance prevails over a wide 
frequency range, as indicated by the shaded region. Near f = 2farist 
the magnitude of G is small, hence the negative resistance contribution 
of FG is small and is likely to be outweighed by parasitic end resistance. 
At frequencies beyond 2farir, negative resistance occurs in higher-order 
modes. At frequencies much higher than fei the phase gr depends on 
frequency in a complicated way and the present calculations lose ac- 
curacy. As the current density is increased, the gr curve moves to the 
right with some softening cf the transition. The lowest frequency at 
which negative resistance sets in increases also, but a wide range of 
negative resistance continues to exist. The behavior thus far described 
is typical of a single-diffused p-n junction diode. 

Next, consider a typical “Read” structure, 1.e., one in which the 
drift space is much wider than the avalanche region. We consider the 
same avalanche zone as before but let the drift space be, say 8 microns 
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with a drift frequency of 6.25 GHz, as shown in Fig. 10. We now see 
that the frequency range for negative resistance in the 7 mode is much 
narrower; at 1000 A/cm’ there is no 7-mode negative resistance for this 
structure. 

Now let us consider structures with narrow drift regions. The extreme 
ease for which the drift regions coincide with the avalanche region 
is represented by p-i-n diodes.’ All three terms in (8) contribute, but 
the conclusions reached from a study of Zz alone are qualitatively 
correct. Negative resistance is obtained for a wide range of frequencies. 
For a consideration of the low-frequency behavior we expand F' to 
first order in f 








ef 
wi+tit {2 
Ff, (72) 
(cf. (57) and (58). With 
CSO oe vs 
5 TG MN) 
(68) yields 
1 Zhu) 
aS TS 4 
O Afaritt a ( ahs Me 
Thus, if 
Farite < 5 fe ; (75) 


negative resistance prevails from mz mode frequencies down to de. 
The case of the uniform-field avalanching plasma has been discussed 
by Misawa.° 

For an interpretation of the frequency f,, consider the change in 
electric field 5H, per change in total current in the limit as the frequency 
goes to zero, as shown in Fig. 11, for diode D3 at 100 A/cm’. The space 
charge associated with the additional current causes 6/ to be roughly 
hyperbolically shaped with the apex at the point where hole and electron 
currents are equal. The placement of the 6# curve with respect to zero 
is such that reduction of ionization in the center where 6 is negative is 
compensated by an increase in the adjacent regions. Outside the ava- 
lanche zone 6 changes linearly. At points A and B, separated by dis- 
tance diz the asymptotes to 6H outside the avalanche zone cross zero. 
From a consideration of (28) through (382) it can be seen that 1/f, is 
half the transit time between points A and B, 
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Fig. 9— Superposition of Figs. 8 and 9 appropriate for p-n junction case. 
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Fig. 10 — Superposition of Figs. 8 and 9 appropriate for “Read” diode case. 
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Fig. 11—Incremental field at low frequencies for p-n diode. 


dap 
fs = ag (76) 
If the avalanche zone is narrow compared to the drift region the integral 
over 6F is positive and so is the de incremental resistance. 

For a p-i-n diode (C1 at 100 A/cm’) the incremental field 5£ is as 
shown in Fig. 12. The asymmetry results from the inequality of hole 
and electron ionization coefficients. It is seen by inspection that the 
integral over 6H is negative and also that the transit time between A 
and B is less than r/2 times the transit time through the depletion 
layer (this is the condition corresponding to (75) when both left and 
right drift regions are considered), hence the resistance is negative. 
It is to be noted, however, that if equal hole and ionization rates had 
been used in this calculation, zero incremental de resistance would 
have been obtained. 

Up to this point we have considered the phase conditions for the oc- 
currence of negative resistance. Now let us consider the actual resistance 
or, more conveniently, the admittance Y. Fig. 13 shows plots of the 
imaginary vs real part of Y, with frequency varying along each curve, 
for diode D3, area = 10°* cm’, at various current densities. Fig. 14 
shows a corresponding plot for a Read structure R1 consisting of ava- 
lanche region Cl (1 micron wide, constant field) and an adjacent 
electron drift space 9 microns wide. The diode area is also taken as 
10°* cm’. The admittances in Figs. 13 and 14 are the reciprocal total 
impedance, but differ insignificantly from what would have been ob- 
tained with neglect of the residual impedance. 

Now let us consider the quality factor Q or the related growth factor g, 
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Fig. 13 — Admittance of diode D3. Selected frequencies, in GHz, are marked 
off on each curve. 
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Fig. 14 Admittance of Read diode R1. Selected frequencies, in GHz, are 
marked off on each curve. 
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1--H (77) 
which has the following meaning: If a diode having negative resistance 
is imbedded in a lossless circuit in which it oscillates at frequency f = 
w/2mr, then the absolute values of voltage and current amplitudes vary 
with time as exp (gwt). It is convenient to introduce the complex fre- 
quency variable s 


s= g + 2)f. (78 ) 


From (4), (52), and (53) it can be inferred that under open-circuit 
conditions the diode will oscillate at a value 


c= A = Ta + Uf y 
and, therefore, the open-circuit growth factor is 


Jopen = Fil Se . 
According to the current-dependence of Fig. 6, gopen 18 proportional to 
the square root of the current density. Note that under open-circuit 
conditions g depends on the avalanche structure only and is independent 
of the drift regions. This is not the case under more general circuit 
conditions. 

If the admittance Y is an analytic function of frequency, as is physi- 
cally reasonable and as we shall assume to be the case here, then the 
admittance is defined also for complex frequencies. The largest growth 
rate g(f) for reactive circuit imbedding at a given frequency f is that g 
for which 


1 


Yi(g za t)f\ = ong + FL = 


0 (79) 


if Y is capacitive; or 
YIg + 2)f] + 20g + 2)fC = 0 (80) 


if Y is inductive. Here L or C are chosen so as to resonate the imaginary 
part of Y. The quantity g thus obtained is related to the quality factor 
Q, as conventionally defined, by (77). So far, g is defined at a given 
frequency f and current density. We define as gmax the maximum of g 
(or maxima, where relative maxima exist) with respect to frequency 
at a given current density. Finally, we define as gop: the maximum of 
Jmax With respect to current density. We define by fimax the frequency (ies) 
at which gmax occurs. Figs. 15 and 16 show gmax and fmax aS a function 
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Fig. 15 — Maximum growth rate and open-circuit growth rate of diode D8. 


of current density for diode D3 and Figs. 17 and 18 for diode R1. Also 
shown for comparison are f, and Jopen - Included in Figs. 16 and 18 are 
horizontal lines marked fariss representing the drift frequency of the 
longest (= electron) drift region. An important result contained in 
Figs. 15 to 18 is that gop, is obtained for current densities at which 
the resonance frequency f, is about equal to the drift frequency and that 
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Fig. 16 — Frequency at which gmax occurs and f, for diode D3. 
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- Fig. 17 — Maximum growth rate and open circuit growth rate for Read diode 
1. 


fmax at this current density is some 20 to 30 percent above f,. For 
the diffused diode D3 the drift frequency is high (~ 60 GHz) and there- 
fore, gopt is obtained at a high current density, 3 X 10° amps/cm’. 
This current density is so high that a substantial widening of the ava- 
lanche region, with respect to its low-current-density configuration, 
has taken place. As a consequence, f, now is no longer independent of 
current density, but rather decreases with current density. Deviations 
from the low-current-density relations are also seen in the break- 
away from the line of slope $ for gopen in Fig. 15. 

For the Read diode R1 the largest gopt , occurring in the 7 mode, is 
obtained at 200 amps/cm’. The values of gmax in the nr mode fall off 
rapidly as the frequency approaches (n + 1)farire. For a Read-type 
structure, i.e., one having a substantial drift space in which negligible 
avalanche multiplication takes place, gop: is near 0.13 and is only 
weakly dependent of the detailed structure of the avalanche region. 
On the other hand, for diodes in which the avalanche region occupies 
all of, or a substantial fraction of, the depletion region gop: 18 closely 
related to the value of gopen and is larger than 0.13, as is the case for 
D8. 


VI. LARGE SIGNAL DESIGN CONSIDERATIONS FOR READ DIODES 


In Section V we obtained the result that the optimum growth rate 
under small-signal conditions occurs when farits & f,. This condition 
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provides a relation between current density, drift region width and 
avalanche region width (via f,). It is plausible to assume that the same 
relation should be approximately satisfied for best large-signal perform- 
ance. Preliminary large-signal results* have shown that a direct corre- 
spondence between the large-signal and small-signal properties does not 
exist and that, for example, large-signal self-sustained oscillation can be 
obtained for frequency-current-density combinations for which the 
small-signal resistance is positive. Nevertheless, as a point of departure 
for a large-signal design we choose a structure in which avalanche region, 
drift region, and current density are so related that gop: 1s obtained at 
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Fig. 18 — Frequency at which gmax occurs and f, for diode R1. 


the desired frequency under small signal conditions. This provides one 
constraint. 

Resonance frequency f; as a function of current density for the various 
structures is shown in Fig. 19. Suppose a 5-GHz operating frequency is 
wanted. We would need a resonance frequency about 25 percent lower 
or 4 GHz. Thus, structure Cl at 120 amps/cm?, or C5 at 400 amps/cm?, 
etc. could be used. The drift region would be 12.5 microns wide. 

Another constraint between average current density and frequency is 
imposed by the output limitations of the drift space. The following 
discussion applies to Read diodes having a constant-field drift region 
only. As shown by Read, under large-signal conditions the carrier cur- 
rent through the drift region is carried in the form of charge pulses. If 


*D. L. Scharfetter and H. K. Gummel, work in progress. 
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f is the frequency of operation and Jp¢ the average current density, 
then the charge per pulse is 





Ino 
= eC . 81 
: (81) 
This pulse causes a change in electric field of 
Ine 
AE = : 82 
@) a 


Let Hain, be the field in the drift region at the onset of breakdown; 
we shall use a value of 1.5 X 10° V/cm which is reasonable for silicon. 
AF must not be larger than approximately $ Haris,. The factor of 4 
takes into account that in the desired mode of operation the terminal 
voltage reaches its minimum while the charge moves through the drift 
region and that therefore an additional lowering of the drift field below 
Faris, occurs. Thus, we have the condition 


Ine < geHaritif. (83) 


This constraint is shown in Fig. 19 by the line of slope 1. Only the region 
to the upper left is allowed. For a diode to have a large power capability 
the highest feasible current density should be used. Thus, a design cor- 
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Fig. 19 — Illustration of Read diode design considerations. 
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responding to the circle appears most promising. The diode would operate 
at a current density of 300 amps/cm? and have an avalanche region 4 
microns wide. Note that the avalanche region width would be about 
2 of the drift region width. This is in contrast to Read’s proposal that 
the avalanche region should be very narrow in comparison to the drift 
region. However, the design suggested here is based on a combination 
of small-signal dynamics and large-signal output limitations and may 
have to be modified when large-signal results become available. 


APPENDIX A 


This appendix describes the steps by which self-consistent de solu- 
tions for electric field # and difference current density J, are obtained 
for a given terminal current density Jy. The electric field is the sum of 
a component Ep(x), related to the space charge of the impurities and a 
component 


Eic(x) = (ev/Is)a(x), 


where 2’ = I,/Is , related to the space charge of the mobile carriers. 

(i) Select a point a2; in the avalanche region and assign a trial value 
F for the electric field at this point. Conveniently, x; may be the loca- 
tion of the metallurgical junction, but the choice is not critical. 

(72) Choose a trial function for the difference current [4 . A conveni- 
ent, though crude, choice is 


Ih=Ts, 2st (84) 
h = —TIs, GT> My. (85) 
(iit) Compute the electric field E 
E(x) = Ey + : [ E + No(s) | ds. (86) 
€ Ja, v 


Extend the integral up to boundary points x, and zz to the left and 
right of 21, i.e., to points where the field reaches the value required to 
carry the current through the not swept-out semiconductor region: 


E(«1) = Is/[un | No(xx) |) (87) 
E(ar) = Is/[up | Nov(xr) | i (88) 
Define 


Hp(e) = By + 4f Nols) as (89) 
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Use 
zo(t) = [ Ia(s)/Tz as (90) 


as trial function for z. The beginning of the next step, (7v), is redundant, 
but is required for the iteration loop. 

(2v) Set H(z) = En(x) + (ev/Ts)z0(x), and In = Izzo’. Using these 
values, evaluate a, 6, and H, (29). 

(v) Solve 


[D?+ (@-—aD- Hz =at+ Bb — Hz (91) 


for z with boundary conditions 2’ = —1 at left boundary and z = +1 
at right boundary. 

(vt) Make z the new trial solution zo. 

(vit) Repeat steps (2v) through (vz) until z and zo differ negligibly. 
If the final field at x; differs appreciably from the trial field /, the bound- 
aries are incorrectly defined, and steps (z) and following should be re- 
peated with the current value E(2,) as trial field #,. This time use J, = 
z’Iy instead of (84) and (85) as trial function for J in step (iz). 

(viit) When the conditions of step (vz2) are satisfied, i.e., when z % Zo 
and E(a1) © £,, then H, I, = 2’Is, a, B, and H are the desired self- 
consistent de quantities. 


APPENDIX B 


In this appendix we show how the complex resonance frequency 
A = fa + tf, at which the impedance goes to infinity can be obtained 
directly from (31), rather than by curve fitting of results obtained at a 
set of imaginary s-values s = 0 + @f, or real frequencies. At resonance 
the normalized field z goes to infinity and therefore the right-hand side 
of (31) is negligible compared to the left-hand side, i.e., we have the 
homogeneous equation 


[D> — kh’ + (a+ 6)k+ (a — B)D — Hz = 0. (92) 


Solutions of (92) with boundary conditions (42) and (48) exist only for 
special values of k. The quantity k(k — a — 8) may be considered the 
eigenvalue of (92). Of the possible & values we select the complex 
conjugate pair of smallest absolute value. Let ko be the k with positive 
imaginary part. Then 


vkyo = 2r(fa + tf) (93) 


is the desired resonance frequency. 
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Numerical values of k may be obtained as follows. The differential 
equation (92) is replaced by a set of appropriate difference equations 
for the values z, of z at a set of meshpoints. This set of equations may 
be written in matrix form 


Me De = %, (94) 


where Z is a vector having the z, for its elements. The matrix M, being 
derived from a second-order differential equation, is a “tridiagonal 
matrix”’, i.e., it has nonvanishing elements only on the main diagonal 
and the two adjacent diagonals. Now k must be chosen so that the 
determinant of M vanishes. If ki and kz are trial values and the cor- 
responding values of the determinant are D,; and D,, then one may 
choose as next trial value 


_ Dolke — kx) 


Its = Deed: 


(95) 
and thus iteratively approach the desired value of ko . 

The frequency f, of the zero is chosen so that F has the proper phase 
in the neighborhood of the resonance frequency. Let F be the current 
fraction at the complex frequency s; = fa + tf1, where f; is near f, , 
then 


ff peat 
c= —~ Jae 96 
ol aera (96) 
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Some Examples of Comparisons of 
Connecting Networks 


By V. E. BENES 
(Manuscript received August 17, 1966) 


In the theory of telephone traffic it 1s of interest to compare the performance 
of connecting networks, as measured by the probability by blocking, when 
they are subjected to the same traffic sources. The question arises whether 
there are examples of patrs of networks, with the same number of cross- 
points, whose respective graphs of loss as a function of offered load cross 
each other. The existence of such examples would establish the princtple 
that some network configurations are inherently more efficient at some traffic 
levels than at others, so that the ‘‘excellence” of a network is not necessarily 
a purely combinatorial notion independent of offered traffic. Examples of 
the above phenomenon are exhibited which do not involve only very small 
networks. 


I. INTRODUCTION 


In the theory of telephone traffic it is of interest to compare the 
performance of connecting networks, as measured by the probability 
of blocking, when they are subjected to the same traffic sources. Natu- 
rally, there are cases in which the result of this comparison is inde- 
pendent of the calling rate \.! In this connection, H. O. Pollak has raised 
the question whether there are examples of pairs of networks, with the 
same number of crosspoints, the first of which is better than the second 
at one value of \, while the second is better than the first at another 
value of 2. 

The existence of such examples would establish the principle that 
some network configurations (in particular, some switch sizes) are 
inherently more efficient at some traffic levels than at others, so that 
the ‘‘excellence” of a network is not necessarily a purely combinatorial 
notion independent of offered traffic. We shall exhibit examples of the 
above phenomenon which are nontrivial in that they do not involve 
only very small networks. 
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II. PRELIMINARIES 


The notations and conventions of Refs. 2 and 3 will be used. We 
shall need machinery for studying the probability of blocking at very 
high values of the traffic \; this is provided by the natural expansion 
of the equilibrium state probabilities in inverse powers of ): 


Lemma: The state probabilities {pz ,x € S} can be expanded in a power 
series 


pe =D da(z)X™ (1) 


valid for real and sufficiently large. With w = max | x | , the coefficients 
res 


Adm(x) have the property 


d,(z) =0 for OS m<w-—|al, (2) 
and the numbers dy—\2\(7) satisfy 
(3) 
S(y)du—iy\(y) = > duva(z), ly|<w, 
Aja) (x) = 0. (4) 


Proof: pz(d) is a rational function of \, and so has an expansion of the 
form (1) if \ is large enough. Substitution of (1) into the equilibrium 
condition gives these equations for the coefficients d(-): (No unblocked 
call is rejected.) 


| x | Omi (2 ) 7 8(2)dm(x) me a Ams(y) a > dn Y)T yx. 


It follows at once that if 0 = zero state (with no calls up), then d)(0) = 0, 
and 


s(x)do(a) = >) dol(y)tye, 


yeBs 


so that do(z) = 0 unless x is maximal in the natural partial ordering 
of states. 
Thus, if x is not maximal then 


dn(x) = 0 for |x|<w—m 


holds for m = 0. Assume that it holds for some m — 1 = O. For x not 
maximal, s(x) > 0 and 
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s(x)dn(t) = — |a|dma(a) + 2, Imly) + 2, dm(Y) Tye 


yeBy 


If |x| < w— m, then dns(z) = O and y € Az implies dn_s(y) = 0, both 
by the induction hypothesis. Thus, dm(x) is expressible as a constant 
times dn(0). But 


s(0)dn(0) = pa inal) = 


by the induction hypothesis. 
If a is maximal with |x| < w — m, then 


| x | dm( 2) _ 2, AmtiCYy Tye y m = 0. 


But y « Bz implies | y| = |2| — 1 < w — m — 1, and so daily) = 0. 
(1) and (2) imply (8) and (4). 
The formula 


Ds d(x) = 


|z |=w 
follows from 


m=w—|z | 


and Dae pz = 1 by letting \ > o~. 
It follows from the lemma just proved that for sufficiently high values 
of the traffic parameter \, the probability of blocking has the form 


2 Brdy—k (x ) 





Pr {bl} = eit OGL). - Ae, 
Do We ae 
lz |>k j=w—|z | 


where & is the greatest integer such that some states with & calls in 
progress have blocked calls (8, > 0). In particular, we see that 


0 if kK<w 
lim Pr{bl} = 


A> 00 


IlI. COMPARISONS 


The examples to be studied are the networks A and B in Figs. 1 and 
2, respectively. Both are three-stage networks of the type due to C. 
Clos,‘ each with nr inlets (outlets). We show that there are values of 
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m, n, and r such that (7) A and B have very nearly the same number 
of crosspoints, and (iz) A has lower blocking than B at all sufficiently 
low values of the traffic }, while B has lower blocking than A at all 
sufficiently high values of \. The calculations forming this comparison 
will be carried out in the traffic model of Chapter 8 of Ref. 2; familiarity 
with this model is assumed. 

In A, at least m calls must be in progress in order for there to be any 
blocking. Hence,® 


Pr{bl}4 = «r” + o(d”), A — 0. 
In B there is a least integer k = O such that 
Pr{bl}z = cd” + 0(\’), 0 


with c > 0. We shall show that k S r + 1, independently of the routing 
used to run B. It has been shown’ that the probability p, of a state in 
the model of Ref. 2 is of the form 


_ $n) +121] 1 

Ps = po Ds tee’ e) 

where the sum is over paths 7 on (S, S) permitted by the routing rule 

in use starting at 0 and ending at zx, the product is along the path, and 
U(ar) is the path-length. 

In B it takes r calls in progress to block a call. Choose an outer switch 

on each side of B and consider a sequence of r attempted calls, each of 

which is from one of these switches to the other, together with one 





Fig. 1— Network A: 2mnr + mr? crosspoints. 
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rxr nxn rxr 





Fig. 2— Network B: 2nr? + rn? crosspoints. 


more call c. The last call, c, will have to go on one of the 7 middle switches 
each of which already has exactly one call. (Fig. 3, upper half.) If now 
the other call on the switch carrying c hangs up (Fig. 3, lower half), we 
will have reached a blocking state from 0 with positive probability 
along a path z of length U(r) = r + 2. Since the blocking state reached 
has r calls in progress, there is a contribution in formula (5) of the form 


on c>0. 
It follows that if m > r+ 1, then 
Pr{bl}4 < Pr{bl}, 


for all \ sufficiently small. 
Now take n > m, so that Lemma 1 gives 


= Bxdo(x) 
PUD aie ey 


|z|=nr 


1+ o(1), A> 0, 


I 


At the same time, it can be seen that in network B, 6. = 0 for |x| > 
nr — 2, so that 


23 Bxtla( x) 
Pri{bl}s = leer 


2 
Az > d;(x)x”? 


|x] >nr—2 j=nr—|z| 





+ o(1). 


For |a| = nr, 
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| x | dy(x) = 2 di(y) rye. 


Thus, 
mr = D0 |eld(e) = 2 2 dily)r = oe yey) 
= ys dy(y), 


lyJ=nr—l 


and since a, = 0 for |x| = nr, and a, = s(y) = 1for|y| = mr — 1. 


foe B.ds(2z) 
is | 2, avea( x) + . a, ay) ae 


The leading term is <1, and so for all \ sufficiently large 
Pr{bl}s < Pr{bl}.. 


rxr nxn rxr 





\ 
THIS CALL HANGS 
UP, LEAVING 


rxr nxn rxr \ 





Fig. 3— Blocking state of A reached in r + 2 steps (only links in use in- 
dicated). 
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It remains to show that there are values of m,n, r such that n > m > 
r + 1 for which the number of crosspoints of A is very nearly equal to 
that of B. Picking m = r + 2, the condition for equality is that 


n=24 (44+ 2+}, 
With [¢] the integer part of t, we pick n as 
24+[((4+2+r7)>24+0+2r+r=r+3>m. 


With this choice of » A actually has more crosspoints than B and yet 
gives higher blocking at large values of \ than B does. 
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Display of Holograms in White Light 
By C. B. BURCKHARDT 
(Manuscript received September 7, 1966) 


This paper describes a new method for displaying holograms in white 
light. The method gives reasonably good reconstructions although cer- 
tain image defects are inherent in the method. It differs from previously 
reported methods of white light reconstruction” * in that the whole spec- 
trum is used for reconstruction and therefore black and white reconstruc- 
tions can be obtained. The method does not depend on the volume prop- 
erties of the photographic emulsion. 

The basic arrangement is shown in Fig. 1. The white light illuminates 
a hologram which had been formed with a plane off-axis reference beam.? 
Behind the hologram there is a Venetian blind structure which blocks 
off the direct light but lets through the diffracted beam. The diffracted 
beam is diffracted a second time at a plane transmission grating which 
can be formed photographically with two plane beams. The angle be- 
tween the two beams which form the plane grating has to be equal to 
the mean angle between the subject beam and reference beam used to 
form the hologram. 

The reconstruction resulting from the configuration of Fig. 1 will now 
be explained. Intuitively, one can say that there is a large color dispersion 
at the first hologram because it can be considered a high spatial frequency 
diffraction grating. Since the light is diffracted in the opposite direction 
by the second grating this color dispersion 1s compensated. In order to 
be more quantitative, assume that during the formation of the hologram 
the subject beam A, on the photographic plate is given by 


A, = a(x,y) exp (just), (1) 


where w, is the mean radian frequency of the subject beam and the 
center of the spatial frequency spectrum of a(x,y) is at zero. Assume that 
the reference beam F is given by 


R = B exp (—ju,2), (2) 


where B is the amplitude and —w, is the radian frequency of the refer- 
ence beam. The photographic plate will record the intensity? and for the 
virtual image term we obtain 
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Fig. 1— White light display of a hologram. 


A, R* = a(x,y)B exp (j[ws fn w,]2x), (3) 


where the star means complex conjugate. 

The plane grating is formed by two plane wave beams which can be 
expressed as G exp (jw.x) and G exp (—jw,2) and for the intensity xx* 
on the plate we will obtain 


xx* = 2G? + G? exp (jlws + |x) + G? exp (—jlw, + w-Jx). (4) 


We now assume that the two plates are developed in such a way that 
their amplitude transmittance is proportional to the intensity during 
exposure. Since the hologram and the plane grating are spaced only a 
short distance behind one another, to get the amplitude transmittance 
corresponding to the virtual image we can multiply the amplitude trans- 
mittance of the hologram, A,-R*, with that of the plane grating yx*. 
We are now interested in the term which is given by the product of (8) 
with the last term of (4). This product, 


a(x,y)BG?, (5) 


is equal to the subject beam term, (1), translated to a center frequency of 
zero. This term, therefore, represents the reconstruction of a virtual image 
in the direction of the illuminating beam. It is possible to make a single 
hologram where the virtual image term has the form of (5). One chooses 
a subject beam as given in (1) and a reference beam B exp (jw,2), 1.e., 
the reference beam has the same mean direction as the subject beam. This 
is the on-axis hologram which has the well-known disadvantage that the 
direct beam, the real image, and the virtual image fall onto each other. 
The configuration of Fig. 1 does not have this disadvantage, with respect 
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to the virtual image; it is, however, in a sense ‘“‘equivalent”’ to the on- 
axis hologram. Therefore, the following explanations will be in terms of 
the equivalent on-axis hologram. 

In the on-axis hologram each object point forms its own on-axis Fres- 
nel zone plate on the photographic plate. Upon illumination each zone 
plate forms a divergent spherical wavefront and therefore, a virtual 
image point. Since the focal length of the zone plate is inversely propor- 
tional to the wavelength of the illuminating light, the virtual image 
points for the different colors are staggered in depth. It is important to 
note that one does not perceive this difference in depth if one looks at the 
virtual image point through the center of its zone plate and if the eye has 
sufficient depth of field as is usually the case. For a particular point of 
observation, a region of image points will be approximately “on-axis” 
and this region will be in sharpest focus and have minimum color. As 
the eye is moved different regions of image points will come into sharp- 
est focus. 

It is possible to use to advantage a spherically converging reference 
and illuminating beam of the same curvature. A little thought will show 
that by placing the eye at the focal point of the reference beam one can 
look at all the virtual image points through the centers of their respective 
zone plates. From this point one therefore sees an image which could be 
called ‘‘quasi-achromatic”. (This can also be shown by using imaging 
formulas.’ They show that for the point mentioned, the ratio between 
the eye-to-image-point distance and lateral image magnification is in- 
dependent of wavelength.) If one moves the eye away from this point 
the image starts to blur. 

Experimentally, it was found that the best reconstructions were ob- 
tained by using a slightly convergent reference beam and viewing the 
reconstruction from a point in front of the focus of the reference beam. 
This is probably so, because one then has more tolerance with respect to 
movement of the eye. 

It is, of course, possible to place the plane grating in front of the holo- 
gram in the configuration of Fig. 1. Particularly bright reconstructions 
are obtained by using the first diffracted order of a blazed reflection 
grating for illuminating the hologram. The reconstruction can then be 
easily viewed against a background of ordinary room light. 

Fig. 2 shows a photograph of the virtual image of a white light re- 
construction. The lens used to form a convergent reference and il- 
luminating beam has a focal length of 48 cm and was placed 18 cm in 
front of the hologram plate. The image-forming photographic lens was 
placed at the focal point of the reference lens where the image is quasi- 
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Fig. 2— Photograph of the virtual image with the image-forming lens at the 
quasi-achromatic point. 


achromatic. The photographic lens had an aperture of 6.5 mm and a focal 
length of 17 cm. The distance between subject and plate during the for- 
mation of the hologram was 18 cm and the angle between the subject 
beam and reference beam was 22°. The angle between the two plane 
beams used to form the photographic grating was also 22°. The Venetian 
blind structure is not visible in Fig. 2 because it was out of focus. A 
zirconium arc lamp was used as white light source. 

The author would like to acknowledge the very competent experimen- 
tal assistance of E. T. Doherty. 
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Approximation of the Error Probability in a Regenerative 
Repeater with Quantized Feedback 


By M. R. AARON and M. K. SIMON 
(Manuscript received September 21, 1966) 


I. INTRODUCTION 


Recently, Zador! gave a clever functional iteration procedure for deter- 
mining the error probability in a binary regenerative repeater with quan- 
tized feedback. Unfortunately, quantitative results for the long pulse se- 
quences of interest are difficult to come by due to the prohibitive amount 
of computer time required to carry out the iterations. We have found a 
simple approximate procedure that breaks the computational bottle- 
neck in all cases of practical interest. The crux of our approach is the ap- 
proximation of the functional iteration by a difference equation. For 
clarity, we use only a few terms of a Taylor series in establishing the dif- 
ference equation approximation. More terms can be used to obtain a 
better approximation if needed. 


II. RECAP OF ZADOR’S WORK 


In Ref. 1, Zador shows that the kth iterate of the transformation 
Uf(x) — denoted by U*f(x) = U*"[Uf(x)] — when evaluated at x = 0 
yields the average bit error probability p(k), for the last bit in a random 
sequence of k + 1 bits processed by a regenerative repeater with quan- 
tized feedback. The transformation Uf(x) is given by [Zador’s (14)] 


Uf(x) = prla)f(rw — a) + palx)f(rz) + ps(x)f(rx + a), (1) 


where 


I 


pIN(—go — x)] 
p(x) = 1 — pix) — ps(x) 


plz) 





p(x) = afl — Ngo — 2)] 
f(x) = pilx) + ps(x) (2) 
N(x) =~ om [. exp (—y'/20°) dy 
0O<r<il 
a= —291 P 


In (2) above, r = exp (—T/T.) = exp (—b) is the decrement of the 
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simple RC low-pass filter in the feedback path of the regenerative re- 
peater; 7’, is the time constant of this filter or equivalently the reciprocal 
of the 3-dB point on the high-pass filter in the transmission line preceding 
the repeater. The quantity go is the peak value of pulse response of the 
medium in response to the transmission of a +1, and gi < 0 is the value 
of the isolated pulse response of the medium one time slot (7) away from 
the peak. For convenience, we assume equally likely transmitted pulses, 
p=q=i. 


IJ. APPROXIMATION 


If we (2) substitute for p(x) from (2) into (1), and (zz) expand f(rx + a) 
in a Taylor’s Series about rz, and (777) retain terms in the series through 
a? we get the approximation 


Use) = fore) + aE) tae) = rile) 








* aFy) - 
ms 74 __, Pile) + pile) 
When we note that 
CUD)  -2c0t Som asada 
dy” | y=0 
p3(0) = pi(0) (4) 
1(0) = p(0) + px(0) = p(0) = [1 — erfe (; walt 
then 


2 372 

Z a df(y) | 

pa) = po) [1+ $2) |. (5) 
Proceeding through the functional iteration defined by (8), retaining 
terms involving a? and lower, and using (4) we obtain the difference 
equation 








p(k) = p(k — 1) + pO) Krer™, (6) 


where Z = g)/c is the peak signal to rms noise ratio for an isolated pulse, 
and 


K= WE (2) exp (—4Z°). (7) 
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Equation (6) is easily solved to give 


p(k) = p(0) E + K 2, ial (8) 
Then 
lim p(t) = p(o) [1+ a]. (9) 


The bracketed term in (8) or (9) gives the enhancement of error proba- 
bility due to errors made on all previous pulses. 

To compare results obtained by application of (9) with those obtained 
experimentally by R. D. Howson,? we take b = 43, go = exp (—#), and 
g. = —go(l — exp (—3)). Over a wide range of S/N ratios of interest, 
analysis based upon (9) predicts about a 0.6-dB S/N penalty of this 
system over the ideal case of no low frequency cutoff. Agreement with 
experiment is excellent. It should be noted that the enhancement term 
in (9) is very close to unity and the S/N penalty is due essentially to 
the reduction of the pulse peak by the low-frequency cut-off. 

In a future paper we will show (2) how Zador’s approach can be ex- 
tended to a wider class of systems, and (zz) how the approximation given 
herein can be used and improved when necessary, to arrive at meaning- 
ful quantitative results. 
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Application of Automatic ‘Transversal Filters 
to the Problem of Echo Suppression 


By F. K. BECKER and H. R. RUDIN 

(Manuscript received October 6, 1966) 

Long-haul voice communication has long been subject to the problem 
of returned echo. The advent of synchronous satellite communication 


introduces increased delay as a degrading factor in the overall quality of 
two-way conversation. This compounds the problem in that the echo 
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must be further attenuated if its annoyance level is not to be increased 
as the delay of the returned echo increases. 

At the present, the problem of returned echo is alleviated by the in- 
sertion of attenuation into the path of the weaker signal, i.e., the echo 
path.! Given the larger delay inherent in synchronous satellite communi- 
cation, a better technique is wanted. One attractive scheme is the use of 
a transversal filter to synthesize a replica of the echo, which is then sub- 
tracted from the actual echo so that the two signals cancel. An algorithm 
which allows this synthesis to be carried out automatically was dis- 
covered by B. F. Logan and the late J. L. Kelly, Jr. 

A scheme for simulating echoes and the technique for suppressing them 
are shown in block form in Fig. 1. The person using the handset at the left 
of the figure would experience echo; no echo is simulated for the person 
using the handset on the right. The echo, which would normally be 
caused by an improperly balanced hybrid, was instead simulated using 
various linear networks. A tape recorder simulated the long delay re- 


n(t) 





= = SUMMER T=DELAY LINE OF LENGTH T 
ATT =ATTENUATOR C.C.= CROSS -CORRELATOR 


Fig. 1—System for demonstration of automatic transversal echo suppressor. 
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sulting from satellite transmission. The automatic transversal filter is 
enclosed in dashed lines; its input is the delayed speech from 7 and its 
function is the minimization of the echo delivered to R; . The signal from 
T. merely appears as noise, n(¢), in this minimization procedure. 

Mathematically, the transversal filter’s function is the minimization 
of the mean-squared value of the echo delivered to R; . This is achieved 
through the approximation of e(¢) by the sum of the weighted, delayed 
versions of s(t). Thus, the canceler strives to minimize 


PS [ et) — > cjs(t — in) | dt, (1) 


where c; is the weighting (attentuator setting) associated with the jth 
tap and 7 is the tap spacing, usually the reciprocal of twice the highest 
frequency of interest. 

The attenuator settings (which may be positive or negative) can be 
calculated by partial differentiation of I by the various c,’s. Specifically, 


OC; 


aL _ —2 i | et — > est — in) | (s(t — kr)] dé. (2) 


Note that the second equation states that the partial derivative of J 
with respect to the tap gain c is given by the cross-correlation of the 
signal at the kth tap on the delay line with the signal delivered to R,. 
The optimum settings for all the tap weighting coefficients occur when 
all the partial derivatives are zero. 

Assuming a reasonable spectrum for the signal s(é), it can be shown that 
the integral J is a convex function of the tap gains. Given this fact, the 
information contained in the various partial derivatives is sufficient to 
point the way toward the unique minimum of I. 

An experimental implementation of the echo canceler is built around 
a general-purpose automatic equalizer intended for the reduction of 
linear distortion in communication channels.? The attenuators are digi- 
tally-controlled, resistive ladder-networks. In the implementation, the 
information obtained by cross-correlation is used to increase or decrease 
the attenuator setting by a constant increment. The cross-correlation 
coefficients are then recalculated and the attenuators again changed. The 
attenuators are permitted to change their setting only when the s(t) 
signal exceeds a threshold. The attenuators have infinite memory and so 
retain their setting during long lapses in the speech originating at 7, . 
A simple RC low-pass filter provides a sufficient approximation to the 
integration indicated in the second equation. 
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The signal originating at 72, despite the fact that it may well be 
several times the size of the echo, e(é), produces only small perturbations 
in the attenuator settings. This is a result of the powerful cross-correla- 
tion detection used to set the attenuators. In the experimental imple- 
mentation described, a second speaker at 7’. did not perceptibly degrade 
echo suppression. 

Another feature of the system is that it is inherently adaptive. If the 
characteristics of the transmission channel should change, the scheme 
automatically makes the necessary modifications as long as the speaker 
level is above the threshold. This is true provided that the change in the 
channel characteristics occurs at a very slow rate. 

Early results from this implementation indicate that echo suppression 
of some 20 dB is attainable. Further evaluation is necessary to accurately 
predict behavior on real channels. The settling time for this experimental 
echo canceller was in the order of two seconds. The settling time is de- 
pendent on the echo-to-interfering-noise ratio. 

An accompanying brief by A. J. Presti and M. M. Sondhi describes a 
different implementation of an echo canceller based upon similar prin- 
ciples. 

There are a number of engineering problems which must be solved 
before adaptive echo cancellation becomes a practical reality. One of 
these is the long, distortion-free delay which the echo canceler must sup- 
ply. The magnitude of this delay depends on the echo delay. Another 
problem arises when there are several reflection points (hybrids) in the 
echo path. A third problem is that the apparatus tracks a change in 
echo path only if the change is slow and the signal threshold is exceeded. 
Given the solution to these problems, however, the future of this tech- 
nique of echo cancellation is a promising one. 


REFERENCES 


1. Holman, E. W. and Suhocki, V. P., A New Echo Suppressor, Bell Laboratories 
Record, April, 1966, pp. 189-142. 

2. Lucky, R. W. and Rudin, H. R., Generalized Automatic Equalization for Com- 
munication Channels, Proc. IEEE, March, 1966, pp. 439-440. 


B.S.T.J. BRIEFS 1851 


A Self-Adaptive Echo Canceller 


By M. M. SONDHI and A. J. PRESTI 
(Manuscript received October 14, 1966) 


Conventional echo suppressors combat echoes generated at hybrid 
junctions in long distance telephone connections by interrupting the 
return path according to some decision scheme based upon the relative 
levels of the outgoing and return signals. In this brief, a new device is 
described for cancelling the echo without interrupting the return path. We 
call this device an echo canceller to distinguish it from conventional echo 
suppressors. It generates a replica of the echo (which is then subtracted 
from the return signal) by synthesizing a linear approximation to the 
echo transmission path. It is self-adapting in that it automatically tracks 
variations in the echo path which may arise during a telephone conversa- 
tion (e.g., connection or disconnection of extension phones, etc.). 

A schematic of such a self-adapting echo canceller is shown in Fig. 1. 
It is based upon an idea originally proposed by J. L. Kelly, Jr. and B. F. 
Logan, and incorporates modifications which simplify and improve the 
implementation and performance. 

With reference to Fig. 1, let x(¢) be the input speech signal and y(é) 
the return signal. The return signal consists of an echo z(t) (which is the 
result of convolving x(t) with the impulse response h(¢) of the echo path) 
and a noise n(t) (which may include a second speech signal). An estimate 
4(t) of z(t) is substracted from the return signal and the error signal 
e(t) = y(t) — &(t) is continuously used to improve this estimate. The 
signal 2(¢) is given by a linear expansion with time varying coefficients, 
gi(t). Thus, 


Ht) = D altel), (1) 

where 
wt) = [ wi(r)e(t — 7) dr, (2) 
and the w,(r) (¢ = 1, 2, ---) form a complete set of orthogonal functions. 


The dynamic behavior of the system of Fig. 1 is then governed by the 
set of equations 


Sgt) = Klett) i= 1,--- NW. (3) 


Here K is a positive constant and the function F[e] is chosen to be an odd 
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Fig. 1— Schematic of echo canceller. 


and non-decreasing function of the error e. It can then be shown that if 
h(é) can be well approximated by the expansion 


nt) Dhani, (4) 


where the h; are constant or slowly varying, then g,(t) converges to 
hi @ = 1, --- N) in the limit as t > «. That is, 2(t) approaches <(é). 
The quantity 


2d (gi(t) —hi)” 


converges monotonically to zero if n(t) = 0 and if the expansion given 
by (4) is exact. However, convergence takes place even in the presence 
of relatively large amounts of noise. The amount of noise that can be 
tolerated decreases as the speed of convergence increases. The speed of 
convergence depends upon the constant K, the choice of function F’, and 
the level and statistical properties of the signal x(t). The proof of con- 
vergence, estimates of convergence rate, factors affecting choice of F, 
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and the set w.(t) as well as results of computer simulations of such echo- 
cancellers are the subject of a paper under preparation. 

A prototype of such a system has been implemented. In this imple- 
mentation the 2,(¢) are obtained from taps on a delay line so that 
w(t) = 6(t — tT), where T is approximately 0.1 msec and N in (4) is 
50. The function F has been chosen to be an infinite clipper. This allows 
the multipliers 4/,; in Fig. 1 to be replaced by simple transistor switches. 
The integrators also pose no particular problem and simple operational 
amplifiers were found satisfactory. However, the multipliers M.; must 
satisfy more stringent requirements. Their outputs g;(t)z;(t) must be 
strictly proportional to the x;(t), while strict proportionality to the g;(t) 
is of secondary importance. A four quadrant multiplier was designed 
using current-controlled photo-resistors in a feedback circuit. This cir- 
cuit exhibits a nonlinearity of less than 10 percent with respect to the 
gi(t) and less than 0.5 percent with respect to the x;(¢t) over a 50-dB 
dynamic range. 

During time intervals when the input x(¢) is zero, there is no corrective 
feedback and it is important that the gain settings g;(¢) be unaffected by 
drift in the integrators. It is also desirable (though not absolutely essen- 
tial) that the feedback path be opened when the noise n(é) (which, as 
previously noted, may include a second speech signal) is considerably 
larger than the echo. The box designated as speech detector in Fig. 1 
achieves this dual objective. It disconnects the operational amplifiers 
from the integrating capacitors whenever 


(le™|)— (ly¥@|)<e 


where ¢ is a predetermined positive threshold and (_ ) indicates time 
averaging for about 0.5 sec. 

Further details of the implementation will be described in a forthcom- 
ing publication. The results of computer simulations and tests on the 
prototype may be summarized as follows: 

(2) The system converges in about 0.5 second for normal speech 
levels. This convergence time increases to about 5 seconds for a speech 
signal 20 dB lower. 

(77) When the echo canceller and the echo path were simulated on the 
computer, cancellations of 60 dB and higher were easily achieved. How- 
ever, in the case of echoes generated on laboratory simulated telephone 
connections, the cancellation was about 20-25 dB on both the computer 
simulation and the prototype. 

(zit) The system converges in the presence of noise which is up to 8 or 
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10 dB higher than the echo. After the system has converged, however, 
even a much larger noise does not appreciably degrade the cancellation. 

The implementation must be extensively tested in a variety of tele- 
phone circuits before the merits of the proposed system can be fully 
evaluated. 

An accompanying brief by F. K. Becker and H. R. Rudin describes a 
different instrumentation of an echo canceller based upon similar prin- 
ciples. 


